| 11 |
GameplayQA: A Benchmarking Framework for Decision-Dense POV-Synced Multi-Video Understanding of 3D Virtual Agents |
GameplayQA:提出用于评估3D虚拟智能体决策密集型第一视角多视频理解的基准框架。 |
world model world models embodied AI |
|
|
| 12 |
Self-Distillation for Multi-Token Prediction |
提出MTP-D自蒸馏方法,提升LLM多Token预测的效率和接受率。 |
distillation large language model |
|
|
| 13 |
Representation Learning to Study Temporal Dynamics in Tutorial Scaffolding |
提出基于嵌入的表征学习方法,分析辅导对话中的时间动态支架。 |
representation learning large language model |
|
|
| 14 |
Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs? |
揭示自蒸馏对大语言模型推理能力的负面影响 |
distillation |
|
|
| 15 |
Perturbation: A simple and efficient adversarial tracer for representation learning in language models |
提出Perturbation:一种简单高效的对抗追踪器,用于语言模型中的表征学习 |
representation learning |
|
|
| 16 |
MARCH: Multi-Agent Reinforced Self-Check for LLM Hallucination |
提出MARCH,利用多智能体强化自检解决LLM幻觉问题 |
reinforcement learning large language model |
✅ |
|
| 17 |
CoCR-RAG: Enhancing Retrieval-Augmented Generation in Web Q&A via Concept-oriented Context Reconstruction |
提出CoCR-RAG,通过概念重构增强Web问答中的检索增强生成效果。 |
distillation large language model |
|
|
| 18 |
Improving Lean4 Autoformalization via Cycle Consistency Fine-tuning |
通过循环一致性微调提升Lean4自动形式化能力 |
reinforcement learning curriculum learning |
|
|