| 1 |
On the Out-of-Distribution Generalization of Reasoning in Multimodal LLMs for Simple Visual Planning Tasks |
评估多模态LLM在简单视觉规划任务中的推理泛化能力 |
large language model multimodal chain-of-thought |
|
|
| 2 |
MRC-GAT: A Meta-Relational Copula-Based Graph Attention Network for Interpretable Multimodal Alzheimer's Disease Diagnosis |
提出基于Meta关系Copula图注意力网络(MRC-GAT),用于可解释的多模态阿尔茨海默病诊断。 |
multimodal |
|
|
| 3 |
ER-MIA: Black-Box Adversarial Memory Injection Attacks on Long-Term Memory-Augmented Large Language Models |
提出ER-MIA框架,针对长期记忆增强的大语言模型进行黑盒对抗性记忆注入攻击。 |
large language model |
|
|
| 4 |
Discovering Implicit Large Language Model Alignment Objectives |
提出Obj-Disco框架以解决LLM对齐目标不明确问题 |
large language model |
|
|
| 5 |
Operationalising the Superficial Alignment Hypothesis via Task Complexity |
通过任务复杂度量化,揭示大语言模型中的表层对齐假设 |
large language model instruction following |
|
|
| 6 |
Neural Scaling Laws for Boosted Jet Tagging |
研究喷注标记任务的神经标度律,揭示算力、数据与性能间的关系。 |
large language model foundation model |
|
|
| 7 |
CrispEdit: Low-Curvature Projections for Scalable Non-Destructive LLM Editing |
提出CrispEdit以解决大语言模型编辑中的能力保持问题 |
large language model |
|
|
| 8 |
LLM-as-Judge on a Budget |
提出基于多臂赌博机理论的LLM评估方法以优化查询分配 |
large language model |
|
|
| 9 |
Prescriptive Scaling Reveals the Evolution of Language Model Capabilities |
提出Prescriptive Scaling方法,揭示语言模型能力随算力演进规律,并评估其稳定性。 |
foundation model |
|
|
| 10 |
On Surprising Effectiveness of Masking Updates in Adaptive Optimizers |
提出Magma,通过掩码更新优化LLM训练,显著提升性能。 |
large language model |
|
|