| 1 |
Heterogeneous Multi-Agent Reinforcement Learning for Zero-Shot Scalable Collaboration |
提出SHPPO框架以解决多角色动态协作问题 |
reinforcement learning PPO |
|
|
| 2 |
Pixel-wise RL on Diffusion Models: Reinforcement Learning from Rich Feedback |
提出像素级策略优化算法以解决稀疏奖励问题 |
reinforcement learning diffusion policy |
|
|
| 3 |
Enhancing IoT Intelligence: A Transformer-based Reinforcement Learning Methodology |
提出基于Transformer的强化学习方法以提升物联网智能决策能力 |
reinforcement learning PPO |
|
|
| 4 |
Score identity Distillation: Exponentially Fast Distillation of Pretrained Diffusion Models for One-Step Generation |
提出Score Identity Distillation以实现快速生成预训练扩散模型 |
distillation |
✅ |
|
| 5 |
Demonstration Guided Multi-Objective Reinforcement Learning |
提出示范引导的多目标强化学习以解决训练困难问题 |
reinforcement learning |
|
|