cs.LG(2024-10-15)

📊 共 4 篇论文

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (3) 支柱一:机器人控制 (Robot Control) (1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)

#题目一句话要点标签🔗
1 DODT: Enhanced Online Decision Transformer Learning through Dreamer's Actor-Critic Trajectory Forecasting DODT:通过Dreamer的Actor-Critic轨迹预测增强在线决策Transformer学习 reinforcement learning world model dreamer
2 DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation DIAR:基于扩散模型的自适应重估隐式Q学习,解决离线强化学习长程决策问题 reinforcement learning offline RL offline reinforcement learning
3 Disentangled Unsupervised Skill Discovery for Efficient Hierarchical Reinforcement Learning 提出DUSDi,用于学习解耦技能以提升分层强化学习效率 reinforcement learning

🔬 支柱一:机器人控制 (Robot Control) (1 篇)

#题目一句话要点标签🔗
4 Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functions SAVO:通过多动作提议和Q函数近似,缓解确定性策略梯度在复杂Q函数中的次优性 locomotion manipulation dexterous manipulation

⬅️ 返回 cs.LG 首页 · 🏠 返回主页