| 1 |
Constraint-Adaptive Policy Switching for Offline Safe Reinforcement Learning |
提出约束自适应策略切换(CAPS)框架,解决离线安全强化学习中约束变化适应问题 |
reinforcement learning offline RL |
✅ |
|
| 2 |
Elucidating Flow Matching ODE Dynamics with Respect to Data Geometries and Denoisers |
理论分析流匹配ODE动态,揭示数据几何与去噪器作用机制 |
flow matching |
|
|
| 3 |
Effective and Lightweight Representation Learning for Link Sign Prediction in Signed Bipartite Graphs |
提出ELISE:一种高效轻量级的符号二部图链接符号预测方法 |
representation learning |
|
|
| 4 |
Optimistic Critic Reconstruction and Constrained Fine-Tuning for General Offline-to-Online RL |
提出乐观Critic重构与约束微调方法,实现通用离线到在线强化学习 |
reinforcement learning offline RL |
|
|