cs.AI(2026-01-22)
📊 共 23 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (17 🔗1)
支柱二:RL算法与架构 (RL & Architecture) (5)
支柱一:机器人控制 (Robot Control) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (17 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (5 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 18 | From Passive Metric to Active Signal: The Evolving Role of Uncertainty Quantification in Large Language Models | 大型语言模型中不确定性量化的演进:从被动指标到主动信号 | reinforcement learning large language model | ||
| 19 | Decoupling Return-to-Go for Efficient Decision Transformer | 提出解耦决策Transformer(DDT),提升离线强化学习效率与性能。 | reinforcement learning offline RL offline reinforcement learning | ||
| 20 | Off-Policy Actor-Critic with Sigmoid-Bounded Entropy for Real-World Robot Learning | 提出SigEnt-SAC算法,利用单条专家轨迹实现真实机器人环境中的高效强化学习。 | reinforcement learning SAC VLA | ||
| 21 | Structured Hints for Sample-Efficient Lean Theorem Proving | 利用结构化提示提升精简定理证明的样本效率 | reinforcement learning large language model | ||
| 22 | PhysProver: Advancing Automatic Theorem Proving for Physics | PhysProver:首个物理领域自动定理证明框架,提升物理及数学推理能力。 | reinforcement learning foundation model |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 23 | Cosmos Policy: Fine-Tuning Video Models for Visuomotor Control and Planning | Cosmos Policy:通过微调视频模型实现视觉运动控制与规划 | manipulation bi-manual bimanual manipulation |