cs.AI(2024-08-13)
📊 共 13 篇论文 | 🔗 2 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (7 🔗1)
支柱二:RL算法与架构 (RL & Architecture) (5 🔗1)
支柱一:机器人控制 (Robot Control) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (7 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (5 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 8 | Personalized Dynamic Difficulty Adjustment -- Imitation Learning Meets Reinforcement Learning | 提出基于模仿学习与强化学习的个性化动态难度调整方法 | reinforcement learning imitation learning | ||
| 9 | Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents | Agent Q:结合蒙特卡洛树搜索与偏好优化的自主AI Agent推理与学习框架 | behavior cloning DPO direct preference optimization | ||
| 10 | Introduction to Reinforcement Learning | 强化学习入门综述:概述核心概念、方法与学习资源 | reinforcement learning | ||
| 11 | LLMs can Schedule | 利用大型语言模型解决Job Shop调度问题,性能媲美传统神经方法 | reinforcement learning large language model | ||
| 12 | Multi-Agent Continuous Control with Generative Flow Networks | 提出MACFN,通过生成流网络实现多智能体连续控制中的协同探索。 | reinforcement learning flow matching | ✅ |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 13 | EditScribe: Non-Visual Image Editing with Natural Language Verification Loops | EditScribe:利用自然语言验证循环实现非可视图像编辑 | manipulation multimodal |