cs.CV（2024-05-11）

📊 共 8 篇论文 | 🔗 2 篇有代码

🎯 兴趣领域导航

支柱三：空间感知与语义 (Perception & Semantics) (4 🔗1) 支柱一：机器人控制 (Robot Control) (3 🔗1) 支柱九：具身大模型 (Embodied Foundation Models) (1)

🔬 支柱三：空间感知与语义 (Perception & Semantics) (4 篇)

#	题目	一句话要点	标签	🔗	⭐
1	TD-NeRF: Novel Truncated Depth Prior for Joint Camera Pose and Neural Radiance Field Optimization	TD-NeRF：提出截断深度先验，用于联合优化相机位姿和神经辐射场	monocular depth NeRF neural radiance field	✅
2	Learning Monocular Depth from Focus with Event Focal Stack	提出基于事件焦点栈的EDFF网络，用于单目景深估计。	monocular depth
3	DeVOS: Flow-Guided Deformable Transformer for Video Object Segmentation	DeVOS：基于光流引导的可变形Transformer用于视频目标分割	optical flow feature matching
4	Global Motion Understanding in Large-Scale Video Object Segmentation	提出WarpFormer，利用全局运动信息提升大规模视频目标分割的鲁棒性。	optical flow

🔬 支柱一：机器人控制 (Robot Control) (3 篇)

#	题目	一句话要点	标签	🔗	⭐
5	Direct Learning of Mesh and Appearance via 3D Gaussian Splatting	提出基于3D高斯溅射的网格外观联合学习方法，提升重建效率与质量。	manipulation 3D gaussian splatting 3DGS
6	UniGarmentManip: A Unified Framework for Category-Level Garment Manipulation via Dense Visual Correspondence	UniGarmentManip：基于稠密视觉对应的类别级服装操作统一框架	manipulation	✅
7	LogicAL: Towards logical anomaly synthesis for unsupervised anomaly localization	LogicAL：面向无监督异常定位的逻辑异常合成方法	manipulation

🔬 支柱九：具身大模型 (Embodied Foundation Models) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
8	Benchmarking Cross-Domain Audio-Visual Deception Detection	提出跨域视听欺骗检测基准，并设计MM-IDGM算法提升泛化性能。	multimodal

⬅️ 返回 cs.CV 首页 · 🏠 返回主页