cs.CL(2025-12-23)
📊 共 15 篇论文 | 🔗 4 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (8 🔗1)
支柱二:RL算法与架构 (RL & Architecture) (6 🔗3)
支柱一:机器人控制 (Robot Control) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (8 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (6 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 9 | FaithLens: Detecting and Explaining Faithfulness Hallucination | 提出FaithLens,用于检测并解释大语言模型中的忠实性幻觉。 | reinforcement learning large language model | ||
| 10 | Fun-Audio-Chat Technical Report | Fun-Audio-Chat:通过双分辨率语音表示和核心鸡尾酒训练,提升语音交互大模型性能 | DPO instruction following | ||
| 11 | Multi-hop Reasoning via Early Knowledge Alignment | 提出早期知识对齐(EKA)模块,提升迭代RAG多跳推理性能与效率。 | reinforcement learning large language model | ✅ | |
| 12 | Memory-T1: Reinforcement Learning for Temporal Reasoning in Multi-session Agents | 提出Memory-T1框架,利用强化学习解决多轮对话Agent中的时序推理难题。 | reinforcement learning | ✅ | |
| 13 | Distilling to Hybrid Attention Models via KL-Guided Layer Selection | 提出基于KL散度引导的层选择方法,用于将Softmax注意力Transformer蒸馏为混合注意力模型。 | linear attention distillation | ||
| 14 | SpidR: Learning Fast and Stable Linguistic Units for Spoken Language Models Without Supervision | SpidR:无需监督,学习快速稳定的语音单元用于语音语言模型 | representation learning distillation | ✅ |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 15 | AprielGuard | 提出AprielGuard,统一安全风险与对抗威胁,提升LLM安全防护能力 | manipulation large language model |