| 27 |
Supercharging Thermal Gaussian Splatting with Depth Estimation |
提出基于热红外图像和深度估计的TDg方法,加速并提升3D高斯溅射性能。 |
depth estimation 3D gaussian splatting 3D reconstruction |
|
|
| 28 |
PhyGenHOI: Physically-Aware 4D Generation of Dynamic Human-Object Interactions |
PhyGenHOI:提出物理感知的动态人-物交互4D生成框架 |
3DGS motion diffusion model MDM |
✅ |
|
| 29 |
DGSG-Mind: Dynamic 3D Gaussian Scene Graphs for Long-Term Scene Understanding and Grounding |
DGSG-Mind:用于长期场景理解和定位的动态3D高斯场景图 |
scene reconstruction scene understanding semantic mapping |
✅ |
|
| 30 |
Uncertainty-driven 3D Gaussian Splatting Active Mapping via Anisotropic Visibility Field |
提出基于各向异性可见度场的3D高斯溅射主动建图方法,实现不确定性驱动。 |
3D gaussian splatting 3DGS gaussian splatting |
|
|
| 31 |
From General Vision to Reliable Traversability Estimation: Adapting Vision Foundation Models for Unstructured Outdoor Environments |
ViTA:面向非结构化环境,自适应视觉基础模型的可靠地形可通行性估计 |
traversability foundation model |
|
|
| 32 |
FRUC: Feedforward Dynamic Scene Reconstruction from Uncalibrated Collaborative Driving Views |
FRUC:基于无标定协同驾驶视角的动态场景前馈重建 |
3D gaussian splatting gaussian splatting splatting |
|
|
| 33 |
OmniCD: A Foundational Framework for Remote Sensing Image Change Detection Guided by Multimodal Semantics |
OmniCD:多模态语义引导的遥感图像变化检测基础框架 |
semantic map multimodal |
|
|
| 34 |
City-Mesh3R: Simulation-Ready City-Scale 3D Mesh Reconstruction from Multi-View Images |
City-Mesh3R:从多视角图像重建可用于仿真的城市级三维网格模型 |
3D reconstruction gaussian splatting splatting |
|
|
| 35 |
REST3D: Reconstructing Physically Stable 3D Scenes from a Single Image |
REST3D:提出物理约束的单图三维场景重建框架,提升场景物理稳定性。 |
scene understanding penetration human-object interaction |
|
|
| 36 |
Large Depth Completion Model from Sparse Observations |
提出LDCM:基于Transformer的大规模稀疏深度补全模型 |
depth estimation metric depth foundation model |
|
|
| 37 |
Geometry Matters: 3D Foundation Priors for Learning Semantic Correspondence |
提出基于3D先验的语义对应学习框架,提升模型对3D结构的感知能力。 |
sam 3D SAM 3D foundation model |
|
|
| 38 |
MonoPhysics: Estimating Geometry, Appearance, and Physical Parameters from Monocular Videos |
MonoPhysics:单目视频中几何、外观和物理参数的联合估计 |
3D gaussian splatting gaussian splatting splatting |
✅ |
|
| 39 |
Déjà View: Looping Transformers for Multi-View 3D Reconstruction |
Déjà View:循环Transformer用于多视角3D重建,提升效率与性能 |
3D reconstruction |
|
|
| 40 |
Towards Consistent Video Geometry Estimation |
ViGeo:用于视频序列时空一致几何估计的通用前馈模型 |
depth estimation foundation model |
|
|
| 41 |
DVSM: Decoder-only View Synthesis Model Done Right |
DVSM:仅解码器视角合成模型,性能超越传统编码器-解码器结构 |
3DGS foundation model |
|
|
| 42 |
GMOS: Grounding Moving Object Segmentation in 3D Space and Time |
提出GMOS框架以解决移动物体分割中的3D信息缺失问题 |
optical flow |
|
|
| 43 |
BitC-3DGS: High-Capacity 3D Gaussian Splatting Watermarking via Bit Compression |
BitC-3DGS:通过比特压缩实现高容量3D高斯溅射水印 |
3D gaussian splatting 3DGS gaussian splatting |
|
|
| 44 |
Comparative evaluation of photogrammetric reconstruction methods and 3D Gaussian Splatting for road surface roughness analysis |
比较四种三维重建方法以评估路面粗糙度 |
3D gaussian splatting 3DGS 3D reconstruction |
|
|
| 45 |
DGSG-Mind: Dynamic 3D Gaussian Scene Graphs for Long-Term Scene Understanding and Grounding |
提出DGSG-Mind以解决动态3D场景理解中的实例关联脆弱问题 |
scene reconstruction scene understanding semantic mapping |
✅ |
|
| 46 |
Learning Representations from 3D Gaussian Splats |
评估几何深度学习在3D高斯溅射场景理解中的应用 |
3D gaussian splatting 3DGS gaussian splatting |
|
|
| 47 |
Déjà View: Looping Transformers for Multi-View 3D Reconstruction |
Déjà View:循环Transformer用于多视角3D重建,提升效率与性能 |
3D reconstruction |
|
|
| 48 |
Towards Consistent Video Geometry Estimation |
ViGeo:提出用于视频序列时空一致几何估计的通用前馈模型 |
depth estimation foundation model |
|
|
| 49 |
VLM3: Vision Language Models Are Native 3D Learners |
VLM3:利用视觉语言模型实现原生3D场景理解 |
depth estimation |
|
|