cs.CV（2024-04-06）

📊 共 9 篇论文 | 🔗 2 篇有代码

🎯 兴趣领域导航

支柱三：空间感知与语义 (Perception & Semantics) (5 🔗1) 支柱九：具身大模型 (Embodied Foundation Models) (3 🔗1) 支柱二：RL算法与架构 (RL & Architecture) (1)

🔬 支柱三：空间感知与语义 (Perception & Semantics) (5 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Z-Splat: Z-Axis Gaussian Splatting for Camera-Sonar Fusion	提出Z-Splat以解决深度轴缺失锥体问题	gaussian splatting splatting
2	Salient Sparse Visual Odometry With Pose-Only Supervision	提出一种基于姿态监督的显著稀疏视觉里程计以解决环境适应性问题	visual odometry optical flow
3	DATENeRF: Depth-Aware Text-based Editing of NeRFs	提出DATENeRF以解决NeRF场景文本编辑一致性问题	NeRF neural radiance field
4	OmniColor: A Global Camera Pose Optimization Approach of LiDAR-360Camera Fusion for Colorizing Point Clouds	提出OmniColor以解决LiDAR与360度相机融合中的相机姿态优化问题	3D reconstruction scene reconstruction	✅
5	Mixed-Query Transformer: A Unified Image Segmentation Architecture	提出混合查询变换器以解决多任务多数据集图像分割问题	open-vocabulary open vocabulary

🔬 支柱九：具身大模型 (Embodied Foundation Models) (3 篇)

#	题目	一句话要点	标签	🔗	⭐
6	Interpretable Multimodal Learning for Cardiovascular Hemodynamics Assessment	提出多模态学习方法以评估心血管血流动力学	multimodal	✅
7	Self-Training Large Language Models for Improved Visual Program Synthesis With Visual Reinforcement	提出自训练大语言模型以提升视觉程序合成能力	large language model
8	JRDB-Social: A Multifaceted Robotic Dataset for Understanding of Context and Dynamics of Human Interactions Within Social Groups	提出JRDB-Social以解决人类社交行为理解问题	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
9	On Exploring PDE Modeling for Point Cloud Video Representation Learning	提出基于PDE建模的点云视频表示学习方法以解决时空数据关联问题	representation learning contrastive learning

⬅️ 返回 cs.CV 首页 · 🏠 返回主页