cs.AI(2025-09-04)
📊 共 18 篇论文 | 🔗 3 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (11 🔗2)
支柱二:RL算法与架构 (RL & Architecture) (6 🔗1)
支柱一:机器人控制 (Robot Control) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (11 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (6 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 12 | CoT-Space: A Theoretical Framework for Internal Slow-Thinking via Reinforcement Learning | 提出CoT-Space框架,用强化学习提升LLM的链式思考推理能力 | reinforcement learning large language model chain-of-thought | ✅ | |
| 13 | World Model Implanting for Test-time Adaptation of Embodied Agents | 提出WorMI框架,通过世界模型植入实现具身智能体测试时自适应 | world model embodied AI large language model | ||
| 14 | The Physical Basis of Prediction: World Model Formation in Neural Organoids via an LLM-Generated Curriculum | 利用LLM生成课程,在神经类器官中构建世界模型的物理基础研究 | reinforcement learning world model large language model | ||
| 15 | Meta-Policy Reflexion: Reusable Reflective Memory and Rule Admissibility for Resource-Efficient LLM Agent | 提出Meta-Policy Reflexion,提升LLM Agent在资源受限环境下的跨任务适应性。 | reinforcement learning large language model multimodal | ||
| 16 | Decoupled Entity Representation Learning for Pinterest Ads Ranking | 提出解耦实体表示学习框架,提升Pinterest广告排序效果 | representation learning | ||
| 17 | Hybrid Reinforcement Learning and Search for Flight Trajectory Planning | 结合强化学习与搜索算法,加速飞行轨迹规划,提升紧急情况下的航线重算速度。 | reinforcement learning |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 18 | EvoEmo: Towards Evolved Emotional Policies for Adversarial LLM Agents in Multi-Turn Price Negotiation | EvoEmo:面向多轮价格谈判中对抗性LLM智能体的演化情感策略 | manipulation reinforcement learning large language model |