| 1 |
Gaussian Splatting Feature Fields for Privacy-Preserving Visual Localization |
提出高斯溅射特征场(GSFFs),用于隐私保护的视觉定位。 |
representation learning 3D gaussian splatting 3DGS |
|
|
| 2 |
UniLiP: Adapting CLIP for Unified Multimodal Understanding, Generation and Editing |
UniLIP:通过自蒸馏和双条件架构,使CLIP具备统一的多模态理解、生成和编辑能力。 |
distillation large language model multimodal |
✅ |
|
| 3 |
Multi-Modal Motion Retrieval by Learning a Fine-Grained Joint Embedding Space |
提出一种多模态运动检索框架,通过学习细粒度联合嵌入空间提升检索性能。 |
contrastive learning text-to-motion motion generation |
|
|
| 4 |
3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding |
3D-R1:通过增强3D视觉语言模型的推理能力实现统一场景理解 |
reinforcement learning RLHF scene understanding |
✅ |
|
| 5 |
Contrastive Learning-Driven Traffic Sign Perception: Multi-Modal Fusion of Text and Vision |
提出基于对比学习的交通标志感知框架,融合文本与视觉信息,提升长尾分布下的识别精度。 |
contrastive learning open-vocabulary open vocabulary |
|
|
| 6 |
Half-Physics: Enabling Kinematic 3D Human Model with Physical Interactions |
提出Half-Physics机制,实现SMPL-X模型与环境的物理交互 |
reinforcement learning physically plausible penetration |
|
|
| 7 |
FastDriveVLA: Efficient End-to-End Driving via Plug-and-Play Reconstruction-based Token Pruning |
FastDriveVLA:提出基于重建的即插即用式Token剪枝,高效端到端自动驾驶。 |
MAE scene understanding vision-language-action |
|
|
| 8 |
VMatcher: State-Space Semi-Dense Local Feature Matching |
VMatcher:结合Mamba和Transformer的状态空间半稠密局部特征匹配 |
Mamba SSM feature matching |
✅ |
|
| 9 |
FASTopoWM: Fast-Slow Lane Segment Topology Reasoning with Latent World Models |
FASTopoWM:利用潜在世界模型的快慢车道线拓扑推理 |
world model scene understanding |
|
|
| 10 |
Mamba-based Efficient Spatio-Frequency Motion Perception for Video Camouflaged Object Detection |
提出基于Mamba的时空频域运动感知网络Vcamba,用于高效视频伪装目标检测。 |
Mamba state space model |
✅ |
|
| 11 |
MamV2XCalib: V2X-based Target-less Infrastructure Camera Calibration with State Space Model |
提出MamV2XCalib,一种基于V2X和状态空间模型的无目标基础设施相机标定方法 |
Mamba state space model |
✅ |
|
| 12 |
AGA: An adaptive group alignment framework for structured medical cross-modal representation learning |
提出AGA框架,通过自适应分组对齐实现医学跨模态表征学习 |
representation learning contrastive learning |
|
|
| 13 |
Slot Attention with Re-Initialization and Self-Distillation |
提出DIAS以解决对象中心学习中的冗余和监督问题 |
distillation |
✅ |
|
| 14 |
Beyond Linear Bottlenecks: Spline-Based Knowledge Distillation for Culturally Diverse Art Style Classification |
提出基于样条函数的知识蒸馏方法,提升文化艺术风格分类精度 |
distillation |
|
|
| 15 |
Annotation-Free Reinforcement Learning Query Rewriting via Verifiable Search Reward |
提出RL-QR,一种无需标注的强化学习查询重写框架,提升RAG系统检索性能。 |
reinforcement learning |
|
|