cs.AI(2025-10-02)
📊 共 4 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
支柱二:RL算法与架构 (RL & Architecture) (2 🔗1)
支柱九:具身大模型 (Embodied Foundation Models) (1)
支柱一:机器人控制 (Robot Control) (1)
🔬 支柱二:RL算法与架构 (RL & Architecture) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | The Reasoning Boundary Paradox: How Reinforcement Learning Constrains Language Models | 揭示RLVR约束语言模型推理边界的悖论,并提出数据策展算法提升性能 | reinforcement learning large language model | ✅ | |
| 2 | VaPR -- Vision-language Preference alignment for Reasoning | VaPR:通过视觉-语言偏好对齐提升大型视觉语言模型的推理能力 | DPO direct preference optimization |
🔬 支柱九:具身大模型 (Embodied Foundation Models) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 3 | Stacked Regression using Off-the-shelf, Stimulus-tuned and Fine-tuned Neural Networks for Predicting fMRI Brain Responses to Movies (Algonauts 2025 Report) | 利用多模态堆叠回归预测电影刺激下fMRI脑活动,Seinfeld团队Algonauts 2025挑战赛第十名 | large language model multimodal |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 4 | Information Seeking for Robust Decision Making under Partial Observability | InfoSeeker:结合信息搜寻的LLM决策框架,提升部分可观测环境下的决策鲁棒性 | manipulation large language model |