| 1 |
Predictive Multimodal Modeling of Diagnoses and Treatments in EHR |
提出多模态预测模型,用于电子病历中诊断和治疗的早期预测。 |
predictive model multimodal |
|
|
| 2 |
REFN: A Reinforcement-Learning-From-Network Framework against 1-day/n-day Exploitations |
提出REFN框架,利用强化学习训练LLM自主生成网络过滤器,防御1-day/n-day漏洞攻击。 |
reinforcement learning RLHF distillation |
|
|
| 3 |
eMamba: Efficient Acceleration Framework for Mamba Models in Edge Computing |
eMamba:面向边缘计算的Mamba模型高效加速框架 |
Mamba SSM state space model |
|
|
| 4 |
Multi-Agent Reinforcement Learning for Adaptive Resource Orchestration in Cloud-Native Clusters |
提出基于多智能体强化学习的自适应资源编排方法,解决云原生集群中的资源动态性和调度复杂性问题。 |
reinforcement learning policy learning reward shaping |
|
|
| 5 |
Stabilizing Long-term Multi-turn Reinforcement Learning with Gated Rewards |
提出Gated Reward Accumulation以解决长时程强化学习中的奖励稀疏问题 |
reinforcement learning reward shaping |
|
|
| 6 |
Nonlocal Monte Carlo via Reinforcement Learning |
提出基于强化学习的非局部蒙特卡洛方法,加速组合优化问题求解。 |
reinforcement learning deep reinforcement learning |
|
|
| 7 |
SynBrain: Enhancing Visual-to-fMRI Synthesis via Probabilistic Representation Learning |
SynBrain:提出一种基于概率表示学习的视觉到fMRI合成框架,提升神经解码性能。 |
representation learning |
✅ |
|
| 8 |
CURE: Critical-Token-Guided Re-Concatenation for Entropy-Collapse Prevention |
CURE:一种通过关键Token引导重拼接来防止熵崩溃的强化学习方法 |
reinforcement learning large language model |
✅ |
|
| 9 |
Retro-Expert: Collaborative Reasoning for Interpretable Retrosynthesis |
提出Retro-Expert,通过协同推理实现可解释的逆合成预测 |
reinforcement learning large language model |
|
|
| 10 |
Variance Reduced Policy Gradient Method for Multi-Objective Reinforcement Learning |
提出方差缩减策略梯度方法,提升多目标强化学习的样本效率 |
reinforcement learning |
|
|
| 11 |
Geospatial Diffusion for Land Cover Imperviousness Change Forecasting |
提出基于地理空间扩散模型的土地覆盖不透水面变化预测方法 |
MAE spatiotemporal |
|
|
| 12 |
Physics-Informed Reward Machines |
提出物理信息奖励机(pRMs),提升强化学习中复杂任务的表达性和学习效率 |
reinforcement learning reward shaping |
|
|