| 1 |
Large-Scale 3D Ground-Motion Synthesis with Physics-Inspired Latent Operator Flow Matching |
提出GMFlow,利用物理启发的潜在算子流匹配实现大规模3D地面运动合成。 |
flow matching motion synthesis spatiotemporal |
|
|
| 2 |
Federated Distributional Reinforcement Learning with Distributional Critic Regularization |
提出TR-FedDistRL,解决联邦强化学习中值函数平均导致的安全问题。 |
reinforcement learning multimodal |
|
|
| 3 |
Flow Matching Policy with Entropy Regularization |
提出流匹配策略与熵正则化以解决强化学习中的探索问题 |
reinforcement learning diffusion policy flow matching |
|
|
| 4 |
Efficient Exploration at Scale |
提出一种高效在线强化学习算法,利用少量人工反馈数据显著提升LLM性能。 |
reinforcement learning offline RL RLHF |
|
|
| 5 |
Atomic Trajectory Modeling with State Space Models for Biomolecular Dynamics |
提出ATMOS,基于状态空间模型生成生物分子动力学的原子级轨迹,加速药物发现。 |
SSM state space model |
|
|
| 6 |
Efficient Soft Actor-Critic with LLM-Based Action-Level Guidance for Continuous Control |
提出GuidedSAC以解决连续控制中的高效探索问题 |
reinforcement learning SAC large language model |
|
|
| 7 |
DSS-GAN: Directional State Space GAN with Mamba backbone for Class-Conditional Image Synthesis |
DSS-GAN:首个采用Mamba骨干网络的条件图像生成对抗网络,提升图像合成质量。 |
Mamba |
|
|
| 8 |
Benchmarking Reinforcement Learning via Stochastic Converse Optimality: Generating Systems with Known Optimal Policies |
提出基于随机逆最优性的强化学习基准测试框架,生成已知最优策略的系统。 |
reinforcement learning |
|
|
| 9 |
Complementary Reinforcement Learning |
提出互补强化学习,解决Agent在稀疏奖励下经验利用不足的问题 |
reinforcement learning |
|
|
| 10 |
Causal Representation Learning on High-Dimensional Data: Benchmarks, Reproducibility, and Evaluation Metrics |
针对因果表示学习,提出基准测试、可复现性分析及综合评估指标。 |
representation learning |
|
|
| 11 |
Operator-Theoretic Foundations and Policy Gradient Methods for General MDPs with Unbounded Costs |
提出基于算子理论的策略梯度方法,解决一般MDP中无界代价问题 |
reinforcement learning PPO |
|
|