| 19 |
MoDoMoDo: Multi-Domain Data Mixtures for Multimodal LLM Reinforcement Learning |
提出多域数据混合策略以提升多模态LLM的强化学习能力 |
reinforcement learning large language model multimodal |
|
|
| 20 |
Harnessing Foundation Models for Robust and Generalizable 6-DOF Bronchoscopy Localization |
提出PANSv2以解决支气管镜定位的鲁棒性与泛化问题 |
Mamba depth estimation foundation model |
|
|
| 21 |
Reinforcing Video Reasoning with Focused Thinking |
提出TW-GRPO以解决视频推理中的无效链条和奖励稀疏问题 |
reinforcement learning spatiotemporal large language model |
✅ |
|
| 22 |
VideoCAD: A Dataset and Model for Learning Long-Horizon 3D CAD UI Interactions from Video |
提出VideoCAD以解决复杂3D CAD界面交互学习问题 |
behavior cloning large language model multimodal |
|
|
| 23 |
ACM-UNet: Adaptive Integration of CNNs and Mamba for Efficient Medical Image Segmentation |
提出ACM-UNet以解决医疗图像分割中的结构不匹配问题 |
Mamba SSM state space model |
✅ |
|
| 24 |
LTM3D: Bridging Token Spaces for Conditional 3D Generation with Auto-Regressive Diffusion Framework |
提出LTM3D以解决条件3D生成中的依赖建模问题 |
masked autoencoder 3D gaussian splatting gaussian splatting |
|
|
| 25 |
A Mathematical Perspective On Contrastive Learning |
提出一种数学视角的对比学习框架以解决多模态数据对齐问题 |
contrastive learning multimodal |
|
|
| 26 |
Revisiting Cross-Modal Knowledge Distillation: A Disentanglement Approach for RGBD Semantic Segmentation |
提出CroDiNo-KD以解决RGBD语义分割中的知识蒸馏问题 |
contrastive learning distillation |
|
|
| 27 |
Progressive Class-level Distillation |
提出渐进式类级蒸馏以解决知识蒸馏中的低概率类信息不足问题 |
teacher-student distillation |
|
|
| 28 |
A Cross Branch Fusion-Based Contrastive Learning Framework for Point Cloud Self-supervised Learning |
提出PoCCA框架以提升点云自监督学习效果 |
contrastive learning |
|
|
| 29 |
EgoVIS@CVPR: What Changed and What Could Have Changed? State-Change Counterfactuals for Procedure-Aware Video Representation Learning |
提出状态变化反事实以提升程序意识视频表示学习 |
representation learning |
|
|
| 30 |
Reason-SVG: Hybrid Reward RL for Aha-Moments in Vector Graphics Generation |
提出Reason-SVG以解决SVG生成中的推理不足问题 |
reinforcement learning large language model |
|
|
| 31 |
STORK: Faster Diffusion And Flow Matching Sampling By Resolving Both Stiffness And Structure-Dependence |
提出STORK以解决扩散模型和流匹配模型的采样效率问题 |
flow matching |
✅ |
|
| 32 |
State Estimation and Control of Dynamic Systems from High-Dimensional Image Data |
提出一种新型神经架构以解决动态系统状态估计问题 |
reinforcement learning policy learning |
|
|