cs.LG(2026-04-22)
📊 共 21 篇论文 | 🔗 2 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (10 🔗1)
支柱二:RL算法与架构 (RL & Architecture) (8)
支柱一:机器人控制 (Robot Control) (2 🔗1)
支柱四:生成式动作 (Generative Motion) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (10 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (8 篇)
🔬 支柱一:机器人控制 (Robot Control) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 19 | Occupancy Reward Shaping: Improving Credit Assignment for Offline Goal-Conditioned Reinforcement Learning | 提出Occupancy Reward Shaping,改善离线目标条件强化学习中的信用分配问题 | locomotion manipulation reinforcement learning | ✅ | |
| 20 | Distributional Value Estimation Without Target Networks for Robust Quality-Diversity | QDHUAC:一种无目标网络的分布价值估计方法,用于提升质量多样性算法的鲁棒性 | locomotion reinforcement learning |
🔬 支柱四:生成式动作 (Generative Motion) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 21 | Physics-Conditioned Synthesis of Internal Ice-Layer Thickness for Incomplete Layer Traces | 提出物理条件约束的冰层厚度合成方法,补全雷达图像中不完整的冰层信息 | physically plausible |