| 1 |
Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale |
推出首个万亿参数科学多模态基础模型Intern-S1-Pro,提升通用与科学领域能力。 |
reinforcement learning foundation model multimodal |
|
|
| 2 |
Layer-Specific Lipschitz Modulation for Fault-Tolerant Multimodal Representation Learning |
提出层特异性Lipschitz调制方法,提升多模态表征学习在故障下的鲁棒性 |
representation learning multimodal |
|
|
| 3 |
Spatiotemporal System Forecasting with Irregular Time Steps via Masked Autoencoder |
提出Physics-Spatiotemporal Masked Autoencoder,用于预测具有不规则时间步长的高维时空系统。 |
masked autoencoder spatiotemporal |
|
|
| 4 |
Cooperative Deep Reinforcement Learning for Fair RIS Allocation |
提出基于合作深度强化学习的公平RIS资源分配方案,解决多小区无线网络负载不均问题。 |
reinforcement learning deep reinforcement learning |
|
|
| 5 |
Vision Hopfield Memory Networks |
提出Vision Hopfield Memory Network,提升视觉任务的解释性和数据效率。 |
Mamba foundation model multimodal |
|
|
| 6 |
Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes |
提出Top-K局部支持匹配,解决LLM在长序列On-Policy蒸馏中的不稳定性问题 |
distillation large language model |
|
|
| 7 |
Offline Decision Transformers for Neural Combinatorial Optimization: Surpassing Heuristics on the Traveling Salesman Problem |
利用离线决策Transformer解决TSP问题,超越传统启发式算法 |
reinforcement learning offline RL decision transformer |
|
|
| 8 |
The Symmetric Perceptron: a Teacher-Student Scenario |
提出对称感知器师生框架,解决任意样本密度下的植入推断问题。 |
teacher-student |
|
|
| 9 |
Train at Moving Edge: Online-Verified Prompt Selection for Efficient RL Training of Large Reasoning Model |
提出HIVE框架,通过在线验证提示选择,高效训练大型推理模型的强化学习。 |
reinforcement learning large language model |
|
|