| 1 |
DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models |
提出DA-Code基准,用于评估LLM在Agent数据科学代码生成任务中的能力。 |
large language model |
|
|
| 2 |
Astute RAG: Overcoming Imperfect Retrieval Augmentation and Knowledge Conflicts for Large Language Models |
提出Astute RAG,解决大语言模型检索增强生成中不完美检索和知识冲突问题 |
large language model |
|
|
| 3 |
MoDEM: Mixture of Domain Expert Models |
提出MoDEM:结合领域提示路由与领域专家模型,提升大语言模型性能与效率。 |
large language model |
|
|
| 4 |
Is C4 Dataset Optimal for Pruning? An Investigation of Calibration Data for LLM Pruning |
揭示C4数据集并非LLM剪枝最优选择,提出校准数据选择对剪枝性能影响的关键性。 |
chain-of-thought |
✅ |
|
| 5 |
Exploring Prompt Engineering: A Systematic Review with SWOT Analysis |
对大型语言模型中Prompt工程技术进行SWOT分析与系统性综述 |
large language model |
|
|
| 6 |
Better Language Models Exhibit Higher Visual Alignment |
研究表明,更优的语言模型表现出更高的视觉对齐能力,并提出高效的视觉-语言融合方法ShareLock。 |
large language model |
|
|
| 7 |
Simplicity Prevails: Rethinking Negative Preference Optimization for LLM Unlearning |
SimNPO:通过简化负偏好优化解决LLM的不可学习性问题。 |
large language model |
✅ |
|