| 1 |
STU-PID: Steering Token Usage via PID Controller for Efficient Large Language Model Reasoning |
提出STU-PID以解决大语言模型推理效率问题 |
large language model chain-of-thought |
|
|
| 2 |
RWESummary: A Framework and Test for Choosing Large Language Models to Summarize Real-World Evidence (RWE) Studies |
提出RWESummary框架以评估大语言模型在RWE研究总结中的表现 |
large language model foundation model |
|
|
| 3 |
Parallel Continuous Chain-of-Thought with Jacobi Iteration |
提出并行连续思维链方法以提升推理效率 |
large language model chain-of-thought |
✅ |
|
| 4 |
MedTVT-R1: A Multimodal LLM Empowering Medical Reasoning and Diagnosis |
提出MedTVT-R1以解决多疾病诊断的挑战 |
large language model multimodal |
✅ |
|
| 5 |
Benchmarking the Pedagogical Knowledge of Large Language Models |
提出教学知识基准以评估大型语言模型的教育能力 |
large language model |
|
|
| 6 |
Is There a Case for Conversation Optimized Tokenizers in Large Language Models? |
提出对话优化的分词器以提升大型语言模型的效率 |
large language model |
|
|
| 7 |
TReB: A Comprehensive Benchmark for Evaluating Table Reasoning Capabilities of Large Language Models |
提出TReB基准以评估大型语言模型的表格推理能力 |
large language model |
|
|
| 8 |
Enhancing Document Retrieval in COVID-19 Research: Leveraging Large Language Models for Hidden Relation Extraction |
提出Covrelex-SE系统以提升COVID-19研究文献检索效率 |
large language model |
|
|
| 9 |
A Survey of AIOps in the Era of Large Language Models |
综述大语言模型在AIOps中的应用与挑战 |
large language model |
|
|
| 10 |
Less Data Less Tokens: Multilingual Unification Learning for Efficient Test-Time Reasoning in LLMs |
提出L²多语言统一学习以解决大语言模型测试时推理效率问题 |
large language model chain-of-thought |
|
|
| 11 |
Quantifying Fairness in LLMs Beyond Tokens: A Semantic and Statistical Perspective |
提出FiSCo框架以解决LLMs公平性评估问题 |
large language model |
|
|
| 12 |
OMEGA: Can LLMs Reason Outside the Box in Math? Evaluating Exploratory, Compositional, and Transformative Generalization |
提出OMEGA基准以评估LLMs在数学推理中的创新能力 |
chain-of-thought |
|
|
| 13 |
CommVQ: Commutative Vector Quantization for KV Cache Compression |
提出CommVQ以解决长上下文LLM推理中的KV缓存瓶颈问题 |
large language model |
✅ |
|
| 14 |
From Web Search towards Agentic Deep Research: Incentivizing Search with Reasoning Agents |
提出基于推理代理的深度研究方法以提升信息检索能力 |
large language model |
✅ |
|
| 15 |
Existing LLMs Are Not Self-Consistent For Simple Tasks |
提出不一致性度量与自动化方法以解决LLM自洽性问题 |
large language model |
✅ |
|
| 16 |
The Anatomy of Speech Persuasion: Linguistic Shifts in LLM-Modified Speeches |
提出一种新方法分析大型语言模型对演讲说服力的理解 |
large language model |
|
|
| 17 |
Reply to "Emergent LLM behaviors are observationally equivalent to data leakage" |
澄清LLM群体中自组织与模型依赖的动态研究 |
large language model |
|
|
| 18 |
Team LA at SCIDOCA shared task 2025: Citation Discovery via relation-based zero-shot retrieval |
提出基于关系的零-shot检索方法以解决引用发现问题 |
large language model |
|
|