cs.CV(2024-04-06)
📊 共 9 篇论文 | 🔗 2 篇有代码
🎯 兴趣领域导航
支柱三:空间感知与语义 (Perception & Semantics) (5 🔗1)
支柱九:具身大模型 (Embodied Foundation Models) (3 🔗1)
支柱二:RL算法与架构 (RL & Architecture) (1)
🔬 支柱三:空间感知与语义 (Perception & Semantics) (5 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | Z-Splat: Z-Axis Gaussian Splatting for Camera-Sonar Fusion | 提出Z-Splat以解决深度轴缺失锥体问题 | gaussian splatting splatting | ||
| 2 | Salient Sparse Visual Odometry With Pose-Only Supervision | 提出一种基于姿态监督的显著稀疏视觉里程计以解决环境适应性问题 | visual odometry optical flow | ||
| 3 | DATENeRF: Depth-Aware Text-based Editing of NeRFs | 提出DATENeRF以解决NeRF场景文本编辑一致性问题 | NeRF neural radiance field | ||
| 4 | OmniColor: A Global Camera Pose Optimization Approach of LiDAR-360Camera Fusion for Colorizing Point Clouds | 提出OmniColor以解决LiDAR与360度相机融合中的相机姿态优化问题 | 3D reconstruction scene reconstruction | ✅ | |
| 5 | Mixed-Query Transformer: A Unified Image Segmentation Architecture | 提出混合查询变换器以解决多任务多数据集图像分割问题 | open-vocabulary open vocabulary |
🔬 支柱九:具身大模型 (Embodied Foundation Models) (3 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 6 | Interpretable Multimodal Learning for Cardiovascular Hemodynamics Assessment | 提出多模态学习方法以评估心血管血流动力学 | multimodal | ✅ | |
| 7 | Self-Training Large Language Models for Improved Visual Program Synthesis With Visual Reinforcement | 提出自训练大语言模型以提升视觉程序合成能力 | large language model | ||
| 8 | JRDB-Social: A Multifaceted Robotic Dataset for Understanding of Context and Dynamics of Human Interactions Within Social Groups | 提出JRDB-Social以解决人类社交行为理解问题 | large language model |
🔬 支柱二:RL算法与架构 (RL & Architecture) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 9 | On Exploring PDE Modeling for Point Cloud Video Representation Learning | 提出基于PDE建模的点云视频表示学习方法以解决时空数据关联问题 | representation learning contrastive learning |