cs.CL(2026-03-13)
📊 共 14 篇论文 | 🔗 3 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (10 🔗3)
支柱二:RL算法与架构 (RL & Architecture) (3)
支柱一:机器人控制 (Robot Control) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (10 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 11 | Mending the Holes: Mitigating Reward Hacking in Reinforcement Learning for Multilingual Translation | 提出WALAR方法,通过强化学习提升低资源多语翻译LLM性能。 | reinforcement learning large language model | ||
| 12 | EvolveCoder: Evolving Test Cases via Adversarial Verification for Code Reinforcement Learning | EvolveCoder:通过对抗验证进化测试用例,提升代码强化学习效果 | reinforcement learning large language model | ||
| 13 | Rethinking Multiple-Choice Questions for RLVR: Unlocking Potential via Distractor Design | 提出迭代干扰项构建(IDC)框架,提升RLVR中多选题的推理能力。 | reinforcement learning large language model |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 14 | Expert Pyramid Tuning: Efficient Parameter Fine-Tuning for Expertise-Driven Task Allocation | 提出专家金字塔调优(EPT),通过多尺度特征金字塔提升参数高效微调性能。 | manipulation |