cs.AI(2024-10-03)
📊 共 13 篇论文 | 🔗 2 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (9)
支柱二:RL算法与架构 (RL & Architecture) (3 🔗1)
支柱一:机器人控制 (Robot Control) (1 🔗1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (9 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 10 | From Imitation to Exploration: End-to-end Autonomous Driving based on World Model | RAMBLE:基于世界模型的端到端自动驾驶,融合模仿学习与强化学习 | reinforcement learning imitation learning world model | ✅ | |
| 11 | SEAL: SEmantic-Augmented Imitation Learning via Language Model | SEAL:通过语言模型增强语义的模仿学习,解决长时决策任务。 | imitation learning large language model | ||
| 12 | LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning | 提出LLaMA-Berry以解决大型语言模型的数学推理能力不足问题 | reinforcement learning RLHF large language model |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 13 | CAnDOIT: Causal Discovery with Observational and Interventional Data from Time-Series | CAnDOIT:利用观测和干预时序数据进行因果发现,适用于复杂机器人环境 | manipulation | ✅ |