| 1 |
Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning |
提出基于变分偏好学习的个性化RLHF方法,解决用户偏好多样性问题 |
reinforcement learning preference learning RLHF |
|
|
| 2 |
Molecular Graph Representation Learning Integrating Large Language Models with Domain-specific Small Models |
提出MolGraph-LarDo框架,融合大语言模型与领域小模型提升分子图表示学习 |
representation learning large language model |
|
|
| 3 |
Efficient Exploration in Deep Reinforcement Learning: A Novel Bayesian Actor-Critic Algorithm |
提出一种新型贝叶斯Actor-Critic算法,提升深度强化学习中的高效探索能力。 |
reinforcement learning deep reinforcement learning DRL |
|
|
| 4 |
Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Models |
提出MOHAWK方法以将Transformer知识蒸馏至子二次模型 |
Mamba SSM state space model |
|
|
| 5 |
Leveraging Superfluous Information in Contrastive Representation Learning |
提出SuperInfo损失函数,通过区分预测性和冗余信息提升对比学习表征 |
representation learning contrastive learning |
|
|
| 6 |
Data Augmentation of Contrastive Learning is Estimating Positive-incentive Noise |
提出基于π-噪声生成器的对比学习数据增强框架,提升模型性能 |
contrastive learning |
|
|
| 7 |
Liquid Fourier Latent Dynamics Networks for fast GPU-based numerical simulations in computational cardiology |
提出Liquid Fourier LDNets,加速计算心脏病学中基于GPU的数值模拟。 |
latent dynamics |
|
|
| 8 |
Structure-enhanced Contrastive Learning for Graph Clustering |
提出结构增强对比学习(SECL)用于提升图聚类性能 |
contrastive learning |
|
|
| 9 |
GARLIC: GPT-Augmented Reinforcement Learning with Intelligent Control for Vehicle Dispatching |
GARLIC:基于GPT增强强化学习的智能车辆调度框架 |
reinforcement learning |
|
|