cs.AI(2025-06-17)
📊 共 2 篇论文
🎯 兴趣领域导航
🔬 支柱二:RL算法与架构 (RL & Architecture) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs | 提出可验证奖励的强化学习以提升大型语言模型的推理能力 | reinforcement learning large language model chain-of-thought |
🔬 支柱九:具身大模型 (Embodied Foundation Models) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 2 | Think Clearly: Improving Reasoning via Redundant Token Pruning | 通过冗余标记修剪提升推理能力 | large language model |