cs.LG(2025-07-11)
📊 共 23 篇论文 | 🔗 4 篇有代码
🎯 兴趣领域导航
支柱二:RL算法与架构 (RL & Architecture) (10 🔗2)
支柱九:具身大模型 (Embodied Foundation Models) (8 🔗2)
支柱一:机器人控制 (Robot Control) (4)
支柱四:生成式动作 (Generative Motion) (1)
🔬 支柱二:RL算法与架构 (RL & Architecture) (10 篇)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (8 篇)
🔬 支柱一:机器人控制 (Robot Control) (4 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 19 | SPLASH! Sample-efficient Preference-based inverse reinforcement learning for Long-horizon Adversarial tasks from Suboptimal Hierarchical demonstrations | SPLASH:基于偏好的逆强化学习,从次优分层演示中学习长时对抗任务 | sim-to-real reinforcement learning inverse reinforcement learning | ||
| 20 | Behavioral Exploration: Learning to Explore via In-Context Adaptation | 提出行为探索方法,通过上下文适应学习探索策略,提升机器人自主探索能力。 | locomotion manipulation | ||
| 21 | Entangled Threats: A Unified Kill Chain Model for Quantum Machine Learning Security | 提出量子机器学习安全统一杀伤链模型,应对复杂攻击,促进全面防御。 | manipulation | ||
| 22 | Prediction of Lane Change Intentions of Human Drivers using an LSTM, a CNN and a Transformer | 利用LSTM、CNN和Transformer预测人类驾驶员的变道意图 | motion planning |
🔬 支柱四:生成式动作 (Generative Motion) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 23 | Theory-Informed Improvements to Classifier-Free Guidance for Discrete Diffusion Models | 针对离散扩散模型的无分类器引导理论优化,提升生成质量 | classifier-free guidance |