cs.LG(2025-06-02)
📊 共 4 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
支柱一:机器人控制 (Robot Control) (2 🔗1)
支柱九:具身大模型 (Embodied Foundation Models) (1)
支柱二:RL算法与架构 (RL & Architecture) (1)
🔬 支柱一:机器人控制 (Robot Control) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | Leveraging Analytic Gradients in Provably Safe Reinforcement Learning | 提出首个有效的分析梯度安全强化学习保障方法 | sim-to-real reinforcement learning differentiable simulation | ✅ | |
| 2 | Trajectory First: A Curriculum for Discovering Diverse Policies | 提出基于轨迹的课程以提升多样性策略学习 | manipulation reinforcement learning |
🔬 支柱九:具身大模型 (Embodied Foundation Models) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 3 | SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics | 提出SmolVLA以解决现有VLA模型的高成本问题 | vision-language-action VLA multimodal |
🔬 支柱二:RL算法与架构 (RL & Architecture) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 4 | Bregman Centroid Guided Cross-Entropy Method | 提出Bregman质心引导的交叉熵方法以解决多模态优化问题 | reinforcement learning multimodal |