| 14 |
Teacher-Student Diffusion Model for Text-Driven 3D Hand Motion Generation |
提出TSHaMo:一种用于文本驱动3D手部动作生成的Teacher-Student扩散模型 |
teacher-student motion generation MANO |
|
|
| 15 |
Latent-WAM: Latent World Action Modeling for End-to-End Autonomous Driving |
Latent-WAM:基于潜在世界行动建模的端到端自动驾驶框架 |
world model world models world action model |
|
|
| 16 |
Le MuMo JEPA: Multi-Modal Self-Supervised Representation Learning with Learnable Fusion Tokens |
Le MuMo JEPA:利用可学习融合令牌的多模态自监督表征学习 |
JEPA representation learning multimodal |
|
|
| 17 |
Toward Physically Consistent Driving Video World Models under Challenging Trajectories |
提出PhyGenesis,解决自动驾驶世界模型在异常轨迹下的物理不一致性问题。 |
world model world models physically plausible |
✅ |
|
| 18 |
RS-SSM: Refining Forgotten Specifics in State Space Model for Video Semantic Segmentation |
提出RS-SSM,通过细化遗忘的特定信息,提升状态空间模型在视频语义分割中的性能。 |
SSM state space model spatiotemporal |
✅ |
|
| 19 |
CAKE: Real-time Action Detection via Motion Distillation and Background-aware Contrastive Learning |
CAKE:基于运动知识蒸馏和背景感知对比学习的实时行为检测 |
contrastive learning distillation optical flow |
|
|
| 20 |
PointRFT: Explicit Reinforcement Fine-tuning for Point Cloud Few-shot Learning |
PointRFT:用于点云少样本学习的显式强化微调方法 |
reinforcement learning representation learning reward design |
|
|
| 21 |
DecepGPT: Schema-Driven Deception Detection with Multicultural Datasets and Robust Multimodal Learning |
DecepGPT:提出模式驱动的多文化多模态欺骗检测方法,提升鲁棒性与可解释性。 |
distillation multimodal |
|
|
| 22 |
CliPPER: Contextual Video-Language Pretraining on Long-form Intraoperative Surgical Procedures for Event Recognition |
CliPPER:用于术中手术长视频事件识别的上下文视频-语言预训练 |
contrastive learning foundation model multimodal |
✅ |
|
| 23 |
Powerful Teachers Matter: Text-Guided Multi-view Knowledge Distillation with Visual Prior Enhancement |
提出文本引导的多视角知识蒸馏,提升视觉教师知识质量 |
distillation |
|
|
| 24 |
SEGAR: Selective Enhancement for Generative Augmented Reality |
SEGAR:用于生成式增强现实的选择性增强框架 |
world model world models |
|
|
| 25 |
Heuristic Self-Paced Learning for Domain Adaptive Semantic Segmentation under Adverse Conditions |
提出启发式自步学习框架,解决恶劣环境下域自适应语义分割的类别偏置问题 |
reinforcement learning curriculum learning |
|
|