| 1 |
Universal Battery Degradation Forecasting Driven by Foundation Model Across Diverse Chemistries and Conditions |
提出统一电池衰退预测框架以解决多化学成分挑战 |
representation learning foundation model |
|
|
| 2 |
Efficient Inference for Inverse Reinforcement Learning and Dynamic Discrete Choice Models |
提出一种半参数逆强化学习框架,实现高效且有统计保证的奖励函数推断。 |
reinforcement learning inverse reinforcement learning |
|
|
| 3 |
SmartFlow Reinforcement Learning and Agentic AI for Bike-Sharing Optimisation |
SmartFlow:融合强化学习与Agentic AI优化共享单车动态再平衡 |
reinforcement learning large language model |
|
|
| 4 |
GRADE: Replacing Policy Gradients with Backpropagation for LLM Alignment |
GRADE:用反向传播替代策略梯度,实现LLM对齐 |
reinforcement learning PPO RLHF |
|
|
| 5 |
Lifting Vision: Ground to Aerial Localization with Reasoning Guided Planning |
提出ViReLoc框架,利用视觉推理进行地面到空中定位与规划 |
contrastive learning multimodal |
|
|
| 6 |
How and Why LLMs Generalize: A Fine-Grained Analysis of LLM Reasoning from Cognitive Behaviors to Low-Level Patterns |
提出细粒度LLM推理基准,揭示SFT与RL微调泛化能力差异的深层原因 |
reinforcement learning large language model |
|
|
| 7 |
Hyperspherical Graph Representation Learning via Adaptive Neighbor-Mean Alignment and Uniformity |
HyperGRL:基于超球面表示学习的图神经网络统一框架 |
representation learning |
|
|
| 8 |
Implicit geometric regularization in flow matching via density weighted Stein operators |
提出γ-Flow Matching,通过密度加权Stein算子实现Flow Matching的几何正则化。 |
flow matching |
|
|
| 9 |
Physics-informed Graph Neural Networks for Operational Flood Modeling |
提出DUALFloodGNN,融合物理信息的图神经网络用于快速洪水模拟 |
curriculum learning spatiotemporal |
✅ |
|
| 10 |
Stationary Reweighting Yields Local Convergence of Soft Fitted Q-Iteration |
提出基于平稳重加权的Soft FQI算法,解决离线强化学习中的局部收敛问题 |
reinforcement learning offline reinforcement learning |
|
|