| 8 |
Reinforcement Learning for Flow-Matching Policies |
提出基于强化学习的Flow-Matching策略,提升通用机器人任务性能 |
reinforcement learning imitation learning flow matching |
|
|
| 9 |
Hierarchical Multi-Agent Reinforcement Learning with Control Barrier Functions for Safety-Critical Autonomous Systems |
提出基于控制屏障函数的分层多智能体强化学习方法,用于安全关键自主系统。 |
reinforcement learning policy learning |
|
|
| 10 |
The Tsetlin Machine Goes Deep: Logical Learning and Reasoning With Graphs |
提出Graph Tsetlin Machine,用于图结构数据的可解释深度逻辑学习与推理。 |
reinforcement learning representation learning multimodal |
|
|
| 11 |
eMargin: Revisiting Contrastive Learning with Margin-Based Separation |
提出eMargin:基于边距分离改进对比学习的时间序列表示 |
representation learning contrastive learning |
✅ |
|
| 12 |
Omni-Thinker: Scaling Multi-Task RL in LLMs with Hybrid Reward and Task Scheduling |
Omni-Thinker:通过混合奖励与任务调度扩展LLM中的多任务强化学习 |
reinforcement learning large language model |
|
|