| 1 |
Foundation Models for Structural Health Monitoring |
提出Transformer神经网络作为结构健康监测的基础模型 |
MAE distillation foundation model |
|
|
| 2 |
Rethinking Teacher-Student Curriculum Learning through the Cooperative Mechanics of Experience |
通过合作机制重新思考教师-学生课程学习 |
reinforcement learning curriculum learning teacher-student |
|
|
| 3 |
Grid-Mapping Pseudo-Count Constraint for Offline Reinforcement Learning |
提出网格映射伪计数约束以解决离线强化学习中的OOD问题 |
reinforcement learning SAC offline reinforcement learning |
|
|
| 4 |
AD4RL: Autonomous Driving Benchmarks for Offline Reinforcement Learning with Value-based Dataset |
提出AD4RL以解决离线强化学习在自动驾驶中的数据不足问题 |
reinforcement learning offline reinforcement learning |
|
|
| 5 |
Solving a Real-World Optimization Problem Using Proximal Policy Optimization with Curriculum Learning and Reward Engineering |
提出基于PPO的课程学习与奖励工程以优化废物分类问题 |
reinforcement learning PPO curriculum learning |
|
|
| 6 |
Methodology for Interpretable Reinforcement Learning for Optimizing Mechanical Ventilation |
提出可解释的强化学习方法以优化机械通气控制 |
reinforcement learning behavior cloning |
|
|
| 7 |
MARL-LNS: Cooperative Multi-agent Reinforcement Learning via Large Neighborhoods Search |
提出MARL-LNS以解决多智能体强化学习训练效率低下问题 |
reinforcement learning |
|
|
| 8 |
Model-based Reinforcement Learning for Parameterized Action Spaces |
提出DLPA算法以解决参数化动作空间中的强化学习问题 |
reinforcement learning |
|
|
| 9 |
Linear Attention Sequence Parallelism |
提出线性注意力序列并行方法以提升长序列处理效率 |
linear attention |
✅ |
|
| 10 |
Reinforcement Learning in Categorical Cybernetics |
将强化学习算法纳入范畴控制论框架以提升学习效率 |
reinforcement learning |
|
|
| 11 |
Convergence Analysis of Flow Matching in Latent Space with Transformers |
提出流匹配方法以确保ODE生成模型的收敛性 |
flow matching |
|
|
| 12 |
Masked Completion via Structured Diffusion with White-Box Transformers |
提出CRATE-MAE以解决无监督表示学习中的结构化问题 |
representation learning masked autoencoder MAE |
✅ |
|
| 13 |
Improve Knowledge Distillation via Label Revision and Data Selection |
通过标签修正与数据选择提升知识蒸馏效果 |
distillation |
|
|
| 14 |
Generative-Contrastive Heterogeneous Graph Neural Network |
提出生成对比异构图神经网络以解决数据增强不足问题 |
masked autoencoder contrastive learning |
|
|