| 1 |
Digital Twin Supervised Reinforcement Learning Framework for Autonomous Underwater Navigation |
提出基于数字孪生监督强化学习的水下自主导航框架 |
reinforcement learning deep reinforcement learning PPO |
|
|
| 2 |
Refining Graphical Neural Network Predictions Using Flow Matching for Optimal Power Flow with Constraint-Satisfaction Guarantee |
提出基于流匹配的图神经网络优化方法,保障约束条件下的最优潮流计算 |
flow matching penetration |
|
|
| 3 |
Bandwidth-constrained Variational Message Encoding for Cooperative Multi-agent Reinforcement Learning |
提出BVME:带宽约束下多智能体强化学习的变分消息编码方法 |
reinforcement learning |
|
|
| 4 |
Adaptive Replay Buffer for Offline-to-Online Reinforcement Learning |
提出自适应回放缓存ARB,解决离线到在线强化学习的数据混合难题 |
reinforcement learning |
|
|
| 5 |
Learning Controllable and Diverse Player Behaviors in Multi-Agent Environments |
提出一种可控且多样化的多智能体行为学习框架,用于游戏AI。 |
reinforcement learning PPO |
|
|