| 1 |
Deep Reinforcement Learning for Real-Time Green Energy Integration in Data Centers |
提出基于深度强化学习的能源管理系统,优化数据中心绿色能源实时集成 |
reinforcement learning deep reinforcement learning DRL |
|
|
| 2 |
Market Making Strategies with Reinforcement Learning |
提出基于强化学习的市场做市策略,解决库存风险和非平稳市场动态问题 |
reinforcement learning deep reinforcement learning DRL |
|
|
| 3 |
Revisiting Bisimulation Metric for Robust Representations in Reinforcement Learning |
提出改进的双仿射度量,提升强化学习中鲁棒表征的质量与适应性。 |
reinforcement learning representation learning |
|
|
| 4 |
Optimizing Metachronal Paddling with Reinforcement Learning at Low Reynolds Number |
利用强化学习优化低雷诺数下的后摆运动,探索最优划水策略 |
reinforcement learning |
|
|
| 5 |
Even Faster Simulations with Flow Matching: A Study of Zero Degree Calorimeter Responses |
利用Flow Matching加速零度量能器响应模拟,实现高能物理领域快速仿真 |
flow matching |
✅ |
|
| 6 |
C2G-KD: PCA-Constrained Generator for Data-Free Knowledge Distillation |
提出C2G-KD,一种基于PCA约束生成器的数据自由知识蒸馏框架 |
distillation |
|
|
| 7 |
GLANCE: Graph Logic Attention Network with Cluster Enhancement for Heterophilous Graph Representation Learning |
提出GLANCE,通过逻辑推理、动态图精炼和自适应聚类增强异质图表示学习。 |
representation learning |
|
|
| 8 |
Efficient Uncertainty in LLMs through Evidential Knowledge Distillation |
提出基于证据知识蒸馏的高效LLM不确定性量化方法 |
distillation |
|
|
| 9 |
Group Sequence Policy Optimization |
提出GSPO算法,通过序列级策略优化提升大型语言模型强化学习训练的稳定性与效率。 |
reinforcement learning large language model |
|
|
| 10 |
Hybrid quantum-classical algorithm for near-optimal planning in POMDPs |
提出QBRL算法,加速部分可观测马尔可夫决策过程中的近优规划。 |
reinforcement learning model-based RL |
|
|