cs.AI(2025-10-02)

📊 共 4 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (2 🔗1) 支柱九:具身大模型 (Embodied Foundation Models) (1) 支柱一:机器人控制 (Robot Control) (1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (2 篇)

#题目一句话要点标签🔗
1 The Reasoning Boundary Paradox: How Reinforcement Learning Constrains Language Models 揭示RLVR约束语言模型推理边界的悖论,并提出数据策展算法提升性能 reinforcement learning large language model
2 VaPR -- Vision-language Preference alignment for Reasoning VaPR:通过视觉-语言偏好对齐提升大型视觉语言模型的推理能力 DPO direct preference optimization

🔬 支柱九:具身大模型 (Embodied Foundation Models) (1 篇)

#题目一句话要点标签🔗
3 Stacked Regression using Off-the-shelf, Stimulus-tuned and Fine-tuned Neural Networks for Predicting fMRI Brain Responses to Movies (Algonauts 2025 Report) 利用多模态堆叠回归预测电影刺激下fMRI脑活动,Seinfeld团队Algonauts 2025挑战赛第十名 large language model multimodal

🔬 支柱一:机器人控制 (Robot Control) (1 篇)

#题目一句话要点标签🔗
4 Information Seeking for Robust Decision Making under Partial Observability InfoSeeker:结合信息搜寻的LLM决策框架,提升部分可观测环境下的决策鲁棒性 manipulation large language model

⬅️ 返回 cs.AI 首页 · 🏠 返回主页