| 1 |
Mirror-3DGS: Incorporating Mirror Reflections into 3D Gaussian Splatting |
提出Mirror-3DGS以解决镜面反射建模问题 |
3D gaussian splatting 3DGS gaussian splatting |
|
|
| 2 |
MM3DGS SLAM: Multi-modal 3D Gaussian Splatting for SLAM Using Vision, Depth, and Inertial Measurements |
提出MM3DGS以解决SLAM中的多模态地图表示问题 |
3D gaussian splatting 3DGS gaussian splatting |
✅ |
|
| 3 |
GOV-NeSF: Generalizable Open-Vocabulary Neural Semantic Fields |
提出GOV-NeSF以解决开放词汇3D场景理解的泛化问题 |
implicit representation scene understanding open-vocabulary |
|
|
| 4 |
SGCNeRF: Few-Shot Neural Rendering via Sparse Geometric Consistency Guidance |
提出SGCNeRF以解决稀疏视角下的神经渲染问题 |
NeRF neural radiance field feature matching |
|
|
| 5 |
OVFoodSeg: Elevating Open-Vocabulary Food Image Segmentation via Image-Informed Textual Representation |
提出OVFoodSeg以解决开放词汇食品图像分割问题 |
open-vocabulary open vocabulary |
|
|
| 6 |
Open-Vocabulary Object Detectors: Robustness Challenges under Distribution Shifts |
评估开放词汇物体检测模型的OOD鲁棒性挑战 |
open-vocabulary open vocabulary |
✅ |
|
| 7 |
From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models |
提出开放词汇场景图生成框架以解决视觉关系概念生成问题 |
open-vocabulary open vocabulary |
|
|
| 8 |
CityGaussian: Real-time High-quality Large-Scale Scene Rendering with Gaussians |
提出CityGaussian以解决大规模场景实时渲染问题 |
3D gaussian splatting 3DGS gaussian splatting |
✅ |
|
| 9 |
Feature Splatting: Language-Driven Physics-Based Scene Synthesis and Editing |
提出Feature Splatting以解决动态场景合成与编辑问题 |
splatting foundation model |
|
|
| 10 |
Neural Implicit Representation for Building Digital Twins of Unknown Articulated Objects |
提出神经隐式表示以构建未知关节物体的数字双胞胎 |
3D reconstruction implicit representation |
✅ |
|
| 11 |
HAHA: Highly Articulated Gaussian Human Avatars with Textured Mesh Prior |
提出HAHA以解决单目视频生成可动画人类头像问题 |
gaussian splatting splatting SMPL |
|
|
| 12 |
360+x: A Panoptic Multi-modal Scene Understanding Dataset |
提出360+x数据集以解决多视角多模态场景理解问题 |
scene understanding egocentric |
|
|
| 13 |
BadPart: Unified Black-box Adversarial Patch Attacks against Pixel-wise Regression Tasks |
提出BadPart框架以解决像素级回归任务的黑箱对抗攻击问题 |
depth estimation monocular depth optical flow |
|
|
| 14 |
LoSA: Long-Short-range Adapter for Scaling End-to-End Temporal Action Localization |
提出LoSA以解决长视频动作定位中的内存限制问题 |
optical flow foundation model |
|
|
| 15 |
Scalable Scene Modeling from Perspective Imaging: Physics-based Appearance and Geometry Inference |
提出基于物理的3D场景建模方法以解决深度学习局限性 |
scene reconstruction |
|
|
| 16 |
StructLDM: Structured Latent Diffusion for 3D Human Generation |
提出StructLDM以解决3D人类生成中的结构化表示问题 |
NeRF |
✅ |
|
| 17 |
Improving Visual Recognition with Hyperbolical Visual Hierarchy Mapping |
提出Hi-Mapper以增强视觉场景的层次识别能力 |
scene understanding |
|
|