| 1 |
PaPaGei: Open Foundation Models for Optical Physiological Signals |
PaPaGei:用于光学生理信号的开放式基础模型,提升PPG信号处理性能。 |
representation learning contrastive learning foundation model |
|
|
| 2 |
Deep Reinforcement Learning Agents for Strategic Production Policies in Microeconomic Market Simulations |
提出基于深度强化学习的微观经济市场生产策略优化方法 |
reinforcement learning deep reinforcement learning DRL |
|
|
| 3 |
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model |
提出QDQ算法,通过一致性模型指导Q值分布学习,解决离线强化学习中的Q值高估问题。 |
reinforcement learning offline reinforcement learning |
|
|
| 4 |
Accelerating Direct Preference Optimization with Prefix Sharing |
提出前缀共享DPO加速方法,提升训练吞吐量且不影响收敛性。 |
DPO direct preference optimization |
✅ |
|
| 5 |
Generator Matching: Generative modeling with arbitrary Markov processes |
Generator Matching:基于任意马尔可夫过程的通用生成建模框架 |
flow matching multimodal |
|
|
| 6 |
Uncovering Capabilities of Model Pruning in Graph Contrastive Learning |
提出基于模型剪枝的图对比学习方法,提升无监督图神经网络预训练性能。 |
contrastive learning |
|
|
| 7 |
Domain Specific Data Distillation and Multi-modal Embedding Generation |
提出一种领域数据蒸馏和多模态嵌入生成方法,提升领域特定属性预测精度。 |
distillation |
|
|
| 8 |
CloudCast -- Total Cloud Cover Nowcasting with Machine Learning |
CloudCast:一种基于U-Net的卷积神经网络,用于云量短期预测。 |
MAE optical flow |
✅ |
|
| 9 |
ThunderKittens: Simple, Fast, and Adorable AI Kernels |
ThunderKittens:一种简单、快速且易于维护的AI Kernel框架,提升GPU利用率。 |
state space model linear attention |
|
|