| 17 |
MrGS: Multi-modal Radiance Fields with 3D Gaussian Splatting for RGB-Thermal Novel View Synthesis |
MrGS:基于3D高斯溅射的多模态辐射场,用于RGB-热红外新视角合成 |
3D gaussian splatting 3DGS gaussian splatting |
|
|
| 18 |
Geometry-Consistent 4D Gaussian Splatting for Sparse-Input Dynamic View Synthesis |
提出GC-4DGS,通过几何一致性提升稀疏输入下动态场景的4D高斯溅射渲染质量。 |
monocular depth gaussian splatting splatting |
|
|
| 19 |
HMR3D: Hierarchical Multimodal Representation for 3D Scene Understanding with Large Vision-Language Model |
HMR3D:利用大型视觉语言模型进行3D场景理解的分层多模态表示 |
scene understanding spatial relationship multimodal |
|
|
| 20 |
DenseScan: Advancing 3D Scene Understanding with 2D Dense Annotation |
DenseScan:利用2D密集标注提升3D场景理解能力 |
scene understanding spatial relationship large language model |
|
|
| 21 |
FACT-GS: Frequency-Aligned Complexity-Aware Texture Reparameterization for 2D Gaussian Splatting |
FACT-GS:频率对齐的复杂度感知纹理重参数化高斯溅射,提升渲染质量。 |
gaussian splatting splatting |
|
|
| 22 |
See, Rank, and Filter: Important Word-Aware Clip Filtering via Scene Understanding for Moment Retrieval and Highlight Detection |
提出基于重要词感知的视频片段过滤方法,用于视频时刻检索和高光检测。 |
scene understanding large language model multimodal |
✅ |
|
| 23 |
Image Valuation in NeRF-based 3D reconstruction |
提出一种图像价值评估方法,用于优化NeRF三维重建的图像选择。 |
NeRF neural radiance field scene reconstruction |
|
|
| 24 |
Robust 3DGS-based SLAM via Adaptive Kernel Smoothing |
提出基于自适应核平滑的鲁棒3DGS-SLAM,提升相机位姿跟踪精度 |
3DGS scene reconstruction |
|
|
| 25 |
SpaceMind: Camera-Guided Modality Fusion for Spatial Reasoning in Vision-Language Models |
提出SpaceMind,通过相机引导的多模态融合增强视觉-语言模型中的空间推理能力 |
VGGT large language model multimodal |
|
|
| 26 |
DenoiseGS: Gaussian Reconstruction Model for Burst Denoising |
DenoiseGS:利用高斯重建模型实现高效的Burst图像去噪 |
3D gaussian splatting gaussian splatting splatting |
✅ |
|
| 27 |
Taming the Light: Illumination-Invariant Semantic 3DGS-SLAM |
提出光照不变语义3DGS-SLAM,解决极端光照下SLAM系统性能退化问题 |
3DGS |
|
|
| 28 |
DualCamCtrl: Dual-Branch Diffusion Model for Geometry-Aware Camera-Controlled Video Generation |
DualCamCtrl:用于几何感知相机控制视频生成的双分支扩散模型 |
scene understanding |
✅ |
|