| 22 |
4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models |
提出4D LangSplat,通过多模态大语言模型实现动态场景下的4D语言高斯溅射 |
gaussian splatting splatting open-vocabulary |
|
|
| 23 |
MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction |
MuDG:利用高斯溅射驯服多模态扩散模型,用于城市场景重建 |
3DGS gaussian splatting splatting |
|
|
| 24 |
VicaSplat: A Single Run is All You Need for 3D Gaussian Splatting and Camera Estimation from Unposed Video Frames |
VicaSplat:单次运行即可从无位姿视频帧中进行3D高斯溅射重建和相机估计 |
3D gaussian splatting gaussian splatting splatting |
✅ |
|
| 25 |
OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with Transformer |
提出OVTR,首个端到端开放词汇多目标跟踪Transformer模型 |
open-vocabulary open vocabulary multimodal |
✅ |
|
| 26 |
OSMa-Bench: Evaluating Open Semantic Mapping Under Varying Lighting Conditions |
OSMa-Bench:提出一个基于LLM/LVLM的自动化流水线,用于评估不同光照条件下的开放语义地图构建算法。 |
semantic mapping semantic map ConceptGraphs |
✅ |
|
| 27 |
RI3D: Few-Shot Gaussian Splatting With Repair and Inpainting Diffusion Priors |
RI3D:利用修复和补全扩散先验的少样本高斯溅射 |
3DGS gaussian splatting splatting |
|
|
| 28 |
Flow-NeRF: Joint Learning of Geometry, Poses, and Dense Flow within Unified Neural Representations |
提出Flow-NeRF以解决无先验姿态下的场景重建问题 |
depth estimation NeRF neural radiance field |
✅ |
|
| 29 |
GaussHDR: High Dynamic Range Gaussian Splatting via Learning Unified 3D and 2D Local Tone Mapping |
GaussHDR:通过学习统一的3D和2D局部色调映射实现高动态范围高斯溅射 |
3D gaussian splatting gaussian splatting splatting |
|
|
| 30 |
3D Student Splatting and Scooping |
提出Student Splatting and Scooping (SSS),提升3D高斯溅射的表达能力和参数效率。 |
3D gaussian splatting 3DGS gaussian splatting |
|
|
| 31 |
LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds |
提出LHM:基于单张图像的快速可动画人体重建大模型 |
3D gaussian splatting gaussian splatting splatting |
|
|
| 32 |
TARS: Traffic-Aware Radar Scene Flow Estimation |
TARS:交通感知雷达场景流估计,提升自动驾驶感知能力 |
scene understanding scene flow |
|
|
| 33 |
The Power of One: A Single Example is All it Takes for Segmentation in VLMs |
仅需单样本微调,显著提升视觉语言模型在分割任务中的性能 |
open-vocabulary open vocabulary multimodal |
|
|
| 34 |
ROODI: Reconstructing Occluded Objects with Denoising Inpainters |
ROODI:利用去噪修复器重建3D高斯 Splatting中被遮挡物体 |
3D gaussian splatting gaussian splatting splatting |
|
|
| 35 |
ST-FlowNet: An Efficient Spiking Neural Network for Event-Based Optical Flow Estimation |
提出ST-FlowNet,一种高效的脉冲神经网络,用于事件相机光流估计。 |
optical flow |
|
|
| 36 |
MouseGPT: A Large-scale Vision-Language Model for Mouse Behavior Analysis |
MouseGPT:用于小鼠行为分析的大规模视觉-语言模型 |
open-vocabulary open vocabulary |
|
|
| 37 |
Speedy MASt3R |
Speedy MASt3R:通过后训练优化加速图像匹配,实现实时3D场景理解 |
scene reconstruction |
|
|