cs.CV(2024-05-11)

📊 共 8 篇论文 | 🔗 2 篇有代码

🎯 兴趣领域导航

支柱三:空间感知与语义 (Perception & Semantics) (4 🔗1) 支柱一:机器人控制 (Robot Control) (3 🔗1) 支柱九:具身大模型 (Embodied Foundation Models) (1)

🔬 支柱三:空间感知与语义 (Perception & Semantics) (4 篇)

#题目一句话要点标签🔗
1 TD-NeRF: Novel Truncated Depth Prior for Joint Camera Pose and Neural Radiance Field Optimization TD-NeRF:提出截断深度先验,用于联合优化相机位姿和神经辐射场 monocular depth NeRF neural radiance field
2 Learning Monocular Depth from Focus with Event Focal Stack 提出基于事件焦点栈的EDFF网络,用于单目景深估计。 monocular depth
3 DeVOS: Flow-Guided Deformable Transformer for Video Object Segmentation DeVOS:基于光流引导的可变形Transformer用于视频目标分割 optical flow feature matching
4 Global Motion Understanding in Large-Scale Video Object Segmentation 提出WarpFormer,利用全局运动信息提升大规模视频目标分割的鲁棒性。 optical flow

🔬 支柱一:机器人控制 (Robot Control) (3 篇)

#题目一句话要点标签🔗
5 Direct Learning of Mesh and Appearance via 3D Gaussian Splatting 提出基于3D高斯溅射的网格外观联合学习方法,提升重建效率与质量。 manipulation 3D gaussian splatting 3DGS
6 UniGarmentManip: A Unified Framework for Category-Level Garment Manipulation via Dense Visual Correspondence UniGarmentManip:基于稠密视觉对应的类别级服装操作统一框架 manipulation
7 LogicAL: Towards logical anomaly synthesis for unsupervised anomaly localization LogicAL:面向无监督异常定位的逻辑异常合成方法 manipulation

🔬 支柱九:具身大模型 (Embodied Foundation Models) (1 篇)

#题目一句话要点标签🔗
8 Benchmarking Cross-Domain Audio-Visual Deception Detection 提出跨域视听欺骗检测基准,并设计MM-IDGM算法提升泛化性能。 multimodal

⬅️ 返回 cs.CV 首页 · 🏠 返回主页