| 1 |
QoQ-Med: Building Multimodal Clinical Foundation Models with Domain-Aware GRPO Training |
提出QoQ-Med以解决多模态临床决策中的数据不平衡问题 |
reinforcement learning foundation model multimodal |
✅ |
|
| 2 |
A Brain Graph Foundation Model: Pre-Training and Prompt-Tuning for Any Atlas and Disorder |
提出脑图基础模型以解决神经科学领域的多样性问题 |
masked autoencoder contrastive learning large language model |
✅ |
|
| 3 |
MMedAgent-RL: Optimizing Multi-Agent Collaboration for Multimodal Medical Reasoning |
提出MMedAgent-RL以解决多模态医疗推理中的协作问题 |
reinforcement learning curriculum learning multimodal |
|
|
| 4 |
From Rules to Rewards: Reinforcement Learning for Interest Rate Adjustment in DeFi Lending |
应用离线强化学习优化DeFi借贷中的利率调整 |
reinforcement learning TD3 offline reinforcement learning |
|
|
| 5 |
Adaptive Plane Reformatting for 4D Flow MRI using Deep Reinforcement Learning |
提出AdaPR以解决4D流MRI重建中的适应性问题 |
reinforcement learning deep reinforcement learning DRL |
|
|
| 6 |
Prompt-Tuned LLM-Augmented DRL for Dynamic O-RAN Network Slicing |
提出基于提示调优的LLM增强DRL方法以解决动态O-RAN网络切片问题 |
reinforcement learning deep reinforcement learning DRL |
|
|
| 7 |
A New Spatiotemporal Correlation Anomaly Detection Method that Integrates Contrastive Learning and Few-Shot Learning in Wireless Sensor Networks |
提出MTAD-RD以解决无线传感器网络异常检测中的样本不足问题 |
contrastive learning spatiotemporal |
|
|
| 8 |
Optimizing Sensory Neurons: Nonlinear Attention Mechanisms for Accelerated Convergence in Permutation-Invariant Neural Networks for Reinforcement Learning |
提出非线性注意机制以加速强化学习收敛 |
reinforcement learning linear attention |
|
|
| 9 |
RLAE: Reinforcement Learning-Assisted Ensemble for LLMs |
提出RLAE以解决LLM集成动态权重调整问题 |
reinforcement learning PPO large language model |
|
|
| 10 |
Dynamic Domain Adaptation-Driven Physics-Informed Graph Representation Learning for AC-OPF |
提出DDA-PIGCN以解决AC-OPF约束建模问题 |
representation learning MAE spatiotemporal |
|
|
| 11 |
Optimized Local Updates in Federated Learning via Reinforcement Learning |
通过强化学习优化联邦学习中的本地更新 |
reinforcement learning deep reinforcement learning DRL |
✅ |
|
| 12 |
ORAN-GUIDE: RAG-Driven Prompt Learning for LLM-Augmented Reinforcement Learning in O-RAN Network Slicing |
提出ORAN-GUIDE以解决O-RAN网络切片中的动态资源分配问题 |
reinforcement learning deep reinforcement learning DRL |
|
|
| 13 |
Understanding Behavioral Metric Learning: A Large-Scale Study on Distracting Reinforcement Learning Environments |
提出一种新的度量学习方法以应对强化学习中的干扰问题 |
reinforcement learning deep reinforcement learning |
|
|
| 14 |
Reinforcement Learning for Hanabi |
探索强化学习在Hanabi游戏中的应用与表现 |
reinforcement learning deep reinforcement learning |
|
|
| 15 |
CLARIFY: Contrastive Preference Reinforcement Learning for Untangling Ambiguous Queries |
提出CLARIFY以解决模糊查询的偏好强化学习问题 |
reinforcement learning contrastive learning |
|
|
| 16 |
Mitigating Plasticity Loss in Continual Reinforcement Learning by Reducing Churn |
通过减少波动性提出C-CHAIN以缓解持续强化学习中的可塑性损失 |
reinforcement learning |
|
|
| 17 |
AutoMixAlign: Adaptive Data Mixing for Multi-Task Preference Optimization in LLMs |
提出AutoMixAlign以解决多任务偏好优化问题 |
DPO large language model |
|
|
| 18 |
Comparing Traditional and Reinforcement-Learning Methods for Energy Storage Control |
比较传统与强化学习方法在能源存储控制中的应用 |
reinforcement learning |
|
|