cs.AI(2026-01-28)
📊 共 18 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (13)
支柱二:RL算法与架构 (RL & Architecture) (4 🔗1)
支柱五:交互与反应 (Interaction & Reaction) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (13 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (4 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 14 | Endogenous Reprompting: Self-Evolving Cognitive Alignment for Unified Multimodal Models | 提出内生重提示方法,弥合统一多模态模型中理解与生成间的认知鸿沟。 | reinforcement learning curriculum learning multimodal | ||
| 15 | CtrlCoT: Dual-Granularity Chain-of-Thought Compression for Controllable Reasoning | CtrlCoT:一种双粒度思维链压缩框架,用于可控推理。 | distillation chain-of-thought | ✅ | |
| 16 | PathWise: Planning through World Model for Automated Heuristic Design via Self-Evolving LLMs | PathWise:利用世界模型和自进化LLM进行启发式自动设计 | world model large language model | ||
| 17 | Scaling Medical Reasoning Verification via Tool-Integrated Reinforcement Learning | 提出Tool-Integrated RL框架,提升医学推理验证的可扩展性和可靠性。 | reinforcement learning large language model |
🔬 支柱五:交互与反应 (Interaction & Reaction) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 18 | Normative Equivalence in human-AI Cooperation: Behaviour, Not Identity, Drives Cooperation in Mixed-Agent Groups | 人机协作中,行为而非身份驱动合作:混合群体中的规范等价性研究 | dyadic interaction |