| 1 |
MM-Food-100K: A 100,000-Sample Multimodal Food Intelligence Dataset with Verifiable Provenance |
提出MM-Food-100K多模态食物数据集,用于提升图像营养预测性能。 |
multimodal |
|
|
| 2 |
Enhancing GraphQL Security by Detecting Malicious Queries Using Large Language Models, Sentence Transformers, and Convolutional Neural Networks |
提出基于LLM、Sentence Transformer和CNN的GraphQL恶意查询检测方法,提升API安全性。 |
large language model |
|
|
| 3 |
Modeling Human Responses to Multimodal AI Content |
提出MhAIM数据集与T-Lens系统,用于建模人类对多模态AI生成内容的反应,提升LLM的人类感知能力。 |
multimodal |
|
|
| 4 |
GenOM: Ontology Matching with Description Generation and Large Language Model |
GenOM:利用描述生成和大型语言模型进行本体匹配,提升生物医学领域互操作性。 |
large language model |
|
|
| 5 |
MSRS: Adaptive Multi-Subspace Representation Steering for Attribute Alignment in Large Language Models |
提出MSRS,通过多子空间表示引导实现大语言模型中属性对齐,减少属性间的干扰。 |
large language model |
|
|
| 6 |
Reverse Physician-AI Relationship: Full-process Clinical Diagnosis Driven by a Large Language Model |
提出DxDirector-7B,以LLM驱动全流程临床诊断,显著提升诊断准确率并减轻医生负担。 |
large language model |
|
|
| 7 |
Why Cannot Large Language Models Ever Make True Correct Reasoning? |
论文批判:大型语言模型本质限制使其无法实现真正的正确推理 |
large language model |
|
|
| 8 |
The Knowledge-Reasoning Dissociation: Fundamental Limitations of LLMs in Clinical Natural Language Inference |
揭示LLM在临床自然语言推理中知识与推理的解耦局限性 |
large language model chain-of-thought |
|
|
| 9 |
MCP-Guard: A Multi-Stage Defense-in-Depth Framework for Securing Model Context Protocol in Agentic AI |
MCP-Guard:针对Agentic AI中模型上下文协议的多阶段防御框架 |
large language model |
|
|
| 10 |
Benchmark Dataset Generation and Evaluation for Excel Formula Repair with LLMs |
提出Excel公式修复基准数据集生成与评估方法,提升LLM在公式错误纠正中的应用 |
large language model |
|
|
| 11 |
Who Benefits from AI Explanations? Towards Accessible and Interpretable Systems |
针对视觉障碍用户,研究可访问且可解释的AI系统设计方法 |
multimodal |
|
|
| 12 |
Improving Value-based Process Verifier via Low-Cost Variance Reduction |
提出ComMCS方法,通过低成本方差缩减提升基于价值的过程验证器性能 |
large language model |
|
|
| 13 |
SEQ-GPT: LLM-assisted Spatial Query via Example |
提出SEQ-GPT,利用LLM解决空间示例查询(SEQ)中复杂的位置搜索问题。 |
large language model |
|
|
| 14 |
LeanRAG: Knowledge-Graph-Based Generation with Semantic Aggregation and Hierarchical Retrieval |
LeanRAG:基于知识图谱的语义聚合与分层检索生成框架 |
large language model |
✅ |
|
| 15 |
What to Ask Next? Probing the Imaginative Reasoning of LLMs with TurtleSoup Puzzles |
提出TurtleSoup-Bench,用于评估LLM在信息稀疏环境下的想象推理能力。 |
large language model |
|
|