cs.CL(2025-12-19)
📊 共 18 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (14 🔗1)
支柱二:RL算法与架构 (RL & Architecture) (3)
支柱一:机器人控制 (Robot Control) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (14 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 15 | Reinforcement Learning for Chain of Thought Compression with One-Domain-to-All Generalization | 提出基于强化学习的思维链压缩方法,实现跨领域泛化和效率提升。 | reinforcement learning large language model instruction following | ||
| 16 | ReGal: A First Look at PPO-based Legal AI for Judgment Prediction and Summarization in India | 提出ReGal:一个基于PPO的印度法律AI框架,用于判决预测和摘要生成。 | reinforcement learning PPO | ||
| 17 | Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience | Seed-Prover 1.5:通过经验学习掌握本科水平定理证明 | reinforcement learning large language model |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 18 | Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers | 提出Canon Layers,增强语言模型水平信息流动与推理能力 | manipulation Mamba linear attention |