| 10 |
Extensive Exploration in Complex Traffic Scenarios using Hierarchical Reinforcement Learning |
提出基于分层强化学习的复杂交通场景自动驾驶方案 |
reinforcement learning deep reinforcement learning DRL |
|
|
| 11 |
Reinforcement Learning Controlled Adaptive PSO for Task Offloading in IIoT Edge Computing |
提出基于强化学习控制的自适应PSO算法,用于IIoT边缘计算中的任务卸载。 |
reinforcement learning SAC predictive model |
|
|
| 12 |
Clear Preferences Leave Traces: Reference Model-Guided Sampling for Preference Learning |
提出参考模型引导采样策略,提升偏好学习数据质量和效率 |
preference learning DPO direct preference optimization |
|
|
| 13 |
Inductive Biases for Zero-shot Systematic Generalization in Language-informed Reinforcement Learning |
提出基于神经产生式系统和记忆增强的语言引导强化学习模型,提升零样本系统泛化能力。 |
reinforcement learning |
|
|
| 14 |
Predictive Modeling and Uncertainty Quantification of Fatigue Life in Metal Alloys using Machine Learning |
融合物理模型与机器学习,提升金属疲劳寿命预测精度与不确定性量化 |
predictive model |
|
|
| 15 |
On Accelerating Edge AI: Optimizing Resource-Constrained Environments |
针对资源受限边缘AI,探索深度学习模型加速与优化策略 |
distillation large language model |
|
|
| 16 |
Reliable Pseudo-labeling via Optimal Transport with Attention for Short Text Clustering |
提出POTA框架,利用最优传输和注意力机制进行可靠伪标签生成,提升短文本聚类效果。 |
representation learning contrastive learning |
✅ |
|
| 17 |
Divergence-Augmented Policy Optimization |
提出DAPO方法,通过散度增强策略优化,提升离线数据复用下的强化学习性能。 |
reinforcement learning deep reinforcement learning |
|
|