cs.CV（2025-03-10）

📊 共 8 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱二：RL算法与架构 (RL & Architecture) (4 🔗1) 支柱三：空间感知与语义 (Perception & Semantics) (2) 支柱九：具身大模型 (Embodied Foundation Models) (1) 支柱八：物理动画 (Physics-based Animation) (1)

🔬 支柱二：RL算法与架构 (RL & Architecture) (4 篇)

#	题目	一句话要点	标签	🔗	⭐
1	CoT-Drive: Efficient Motion Forecasting for Autonomous Driving with LLMs and Chain-of-Thought Prompting	CoT-Drive：利用LLM和思维链提示提升自动驾驶运动预测效率	teacher-student distillation scene understanding
2	POp-GS: Next Best View in 3D-Gaussian Splatting with P-Optimality	POp-GS：基于P-最优性的3D高斯溅射下一最佳视角选择	world model 3D gaussian splatting gaussian splatting
3	AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning	AlphaDrive：通过强化学习和推理释放VLM在自动驾驶中的潜力	reinforcement learning multimodal
4	A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning	提出SlotMIM，提升预训练视觉模型在机器人学习中对非物体中心数据的表征能力	MAE scene understanding	✅

🔬 支柱三：空间感知与语义 (Perception & Semantics) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
5	Multi-Modal 3D Mesh Reconstruction from Images and Text	提出一种语言引导的少样本3D网格重建方法，解决零样本方法依赖预训练3D模型的难题。	gaussian splatting splatting
6	FunGraph: Functionality Aware 3D Scene Graphs for Language-Prompted Scene Interaction	FunGraph：面向语言提示场景交互的功能感知3D场景图	affordance

🔬 支柱九：具身大模型 (Embodied Foundation Models) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
7	Lightweight Multimodal Artificial Intelligence Framework for Maritime Multi-Scene Recognition	提出轻量级多模态AI框架，用于提升复杂海事场景识别精度与效率。	large language model multimodal

🔬 支柱八：物理动画 (Physics-based Animation) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
8	Temporal Overlapping Prediction: A Self-supervised Pre-training Method for LiDAR Moving Object Segmentation	提出Temporal Overlapping Prediction自监督预训练方法，提升LiDAR点云移动物体分割性能。	spatiotemporal

⬅️ 返回 cs.CV 首页 · 🏠 返回主页