| 1 |
Random Latent Exploration for Deep Reinforcement Learning |
提出随机潜在空间探索(RLE)算法,提升深度强化学习的探索效率。 |
reinforcement learning deep reinforcement learning |
✅ |
|
| 2 |
Analyzing and Bridging the Gap between Maximizing Total Reward and Discounted Reward in Deep Reinforcement Learning |
提出两种目标对齐方法,解决深度强化学习中总回报与折扣回报差异问题 |
reinforcement learning deep reinforcement learning |
|
|
| 3 |
Event-Triggered Reinforcement Learning Based Joint Resource Allocation for Ultra-Reliable Low-Latency V2X Communications |
提出基于事件触发强化学习的联合资源分配方案,用于超可靠低延迟V2X通信。 |
reinforcement learning deep reinforcement learning DRL |
|
|
| 4 |
Understanding Reinforcement Learning-Based Fine-Tuning of Diffusion Models: A Tutorial and Review |
综述:基于强化学习的扩散模型微调方法,优化生物序列生成任务 |
reinforcement learning PPO |
✅ |
|
| 5 |
Model-based Policy Optimization using Symbolic World Model |
提出基于符号世界模型的策略优化方法,提升机器人学习的样本效率 |
reinforcement learning world model |
|
|
| 6 |
Instance Selection for Dynamic Algorithm Configuration with Reinforcement Learning: Improving Generalization |
提出基于强化学习动态算法配置的实例选择方法,提升泛化性能 |
reinforcement learning deep reinforcement learning |
|
|
| 7 |
Reconfigurable Intelligent Surface Aided Vehicular Edge Computing: Joint Phase-shift Optimization and Multi-User Power Allocation |
提出基于RIS辅助的VEC系统,利用DRL优化相移和功率分配,提升车辆边缘计算性能。 |
reinforcement learning deep reinforcement learning DRL |
|
|
| 8 |
A reinforcement learning strategy to automate and accelerate h/p-multigrid solvers |
提出基于强化学习的h/p-多重网格求解器自动优化策略,提升求解效率与稳定性 |
reinforcement learning |
|
|
| 9 |
Optimistic Q-learning for average reward and episodic reinforcement learning |
提出乐观Q学习算法以解决平均奖励强化学习中的后悔最小化问题 |
reinforcement learning |
|
|
| 10 |
Data-Driven Estimation of Conditional Expectations, Application to Optimal Stopping and Reinforcement Learning |
提出一种纯数据驱动的条件期望估计方法,并应用于最优停止和强化学习。 |
reinforcement learning |
|
|
| 11 |
HHGT: Hierarchical Heterogeneous Graph Transformer for Heterogeneous Graph Representation Learning |
提出HHGT:一种用于异构图表示学习的分层异构图Transformer模型 |
representation learning |
|
|
| 12 |
PG-Rainbow: Using Distributional Reinforcement Learning in Policy Gradient Methods |
PG-Rainbow:将分布强化学习融入策略梯度方法,提升Atari游戏性能 |
reinforcement learning |
|
|