| 1 |
SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding |
提出SeeGround以解决零样本开放词汇3D视觉定位问题 |
open-vocabulary open vocabulary visual grounding |
|
|
| 2 |
HybridGS: Decoupling Transients and Statics with 2D and 3D Gaussian Splatting |
HybridGS:利用2D和3D高斯溅射解耦瞬态和静态场景,实现高质量新视角合成。 |
3D gaussian splatting 3DGS gaussian splatting |
|
|
| 3 |
Towards Real-Time Open-Vocabulary Video Instance Segmentation |
提出TROY-VIS,加速开放词汇视频实例分割,实现实时性。 |
open-vocabulary open vocabulary foundation model |
✅ |
|
| 4 |
DGNS: Deformable Gaussian Splatting and Dynamic Neural Surface for Monocular Dynamic 3D Reconstruction |
DGNS:结合可变形高斯溅射与动态神经表面的单目动态3D重建 |
gaussian splatting splatting scene reconstruction |
|
|
| 5 |
PhysDepth: Plug-and-Play Physical Refinement for Monocular Depth Estimation in Challenging Environments |
PhysDepth:即插即用物理约束单目深度估计,提升恶劣环境性能 |
depth estimation monocular depth |
|
|
| 6 |
Monocular Dynamic Gaussian Splatting: Fast, Brittle, and Scene Complexity Rules |
单目动态高斯溅射:快速但脆弱,受场景复杂度制约 |
gaussian splatting splatting |
|
|
| 7 |
Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation |
Mask-Adapter:通过优化Mask提升开放词汇分割性能 |
open-vocabulary open vocabulary |
✅ |
|
| 8 |
Grounding Descriptions in Images informs Zero-Shot Visual Recognition |
GRAIN:通过图像区域描述对齐,提升零样本视觉识别能力 |
open-vocabulary open vocabulary large language model |
✅ |
|
| 9 |
PBDyG: Position Based Dynamic Gaussians for Motion-Aware Clothed Human Avatars |
提出PBDyG,通过基于位置的动态高斯模型实现运动感知的服装人像重建 |
3D gaussian splatting gaussian splatting splatting |
|
|
| 10 |
EmbodiedOcc: Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding |
EmbodiedOcc:提出基于视觉的在线场景理解的具身3D occupancy预测框架 |
splatting scene understanding |
✅ |
|
| 11 |
Deep Learning and Hybrid Approaches for Dynamic Scene Analysis, Object Detection and Motion Tracking |
提出一种基于深度学习和混合方法的动态场景分析与目标检测跟踪系统,优化视频监控。 |
optical flow motion tracking |
|
|
| 12 |
MT3DNet: Multi-Task learning Network for 3D Surgical Scene Reconstruction |
MT3DNet:用于3D手术场景重建的多任务学习网络 |
depth estimation scene reconstruction |
|
|
| 13 |
Multi-View Pose-Agnostic Change Localization with Zero Labels |
提出一种无标签、视角无关的多视角变化定位方法,基于3D高斯溅射实现。 |
3D gaussian splatting 3DGS gaussian splatting |
|
|
| 14 |
QUEEN: QUantized Efficient ENcoding of Dynamic Gaussians for Streaming Free-viewpoint Videos |
提出QUEEN框架以解决在线自由视角视频流传输问题 |
3D gaussian splatting gaussian splatting splatting |
|
|
| 15 |
Stereo Anywhere: Robust Zero-Shot Deep Stereo Matching Even Where Either Stereo or Mono Fail |
提出Stereo Anywhere,结合几何约束与单目深度先验,实现鲁棒的零样本立体匹配。 |
monocular depth foundation model |
|
|
| 16 |
Turbo3D: Ultra-fast Text-to-3D Generation |
Turbo3D:一种超快速的文本到3D高斯溅射生成系统,可在1秒内生成高质量资产。 |
gaussian splatting splatting |
|
|
| 17 |
MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos |
MegaSaM:基于动态视频的快速、准确、鲁棒的结构与运动重建 |
visual SLAM depth estimation |
|
|
| 18 |
Sparse Voxels Rasterization: Real-time High-fidelity Radiance Field Rendering |
提出基于自适应稀疏体素光栅化的实时高保真辐射场渲染方法 |
gaussian splatting splatting |
|
|