cs.AI(2025-06-17)

📊 共 2 篇论文

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (1) 支柱九:具身大模型 (Embodied Foundation Models) (1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (1 篇)

#题目一句话要点标签🔗
1 Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs 提出可验证奖励的强化学习以提升大型语言模型的推理能力 reinforcement learning large language model chain-of-thought

🔬 支柱九:具身大模型 (Embodied Foundation Models) (1 篇)

#题目一句话要点标签🔗
2 Think Clearly: Improving Reasoning via Redundant Token Pruning 通过冗余标记修剪提升推理能力 large language model

⬅️ 返回 cs.AI 首页 · 🏠 返回主页