| 1 |
Fast-SAM3D: 3Dfy Anything in Images but Faster |
Fast-SAM3D:加速图像三维重建,提升推理效率且保持精度。 |
sam 3D SAM 3D spatiotemporal |
✅ |
|
| 2 |
VGGT-Motion: Motion-Aware Calibration-Free Monocular SLAM for Long-Range Consistency |
VGGT-Motion:面向长距离一致性的无标定单目SLAM系统 |
optical flow VGGT feature matching |
|
|
| 3 |
NeVStereo: A NeRF-Driven NVS-Stereo Architecture for High-Fidelity 3D Tasks |
NeVStereo:一种NeRF驱动的NVS-Stereo架构,用于高保真3D任务 |
depth estimation NeRF VGGT |
|
|
| 4 |
LoGoSeg: Integrating Local and Global Features for Open-Vocabulary Semantic Segmentation |
LoGoSeg:融合局部与全局特征的开放词汇语义分割框架 |
open-vocabulary open vocabulary |
|
|
| 5 |
ShapeGaussian: High-Fidelity 4D Human Reconstruction in Monocular Videos via Vision Priors |
ShapeGaussian:利用视觉先验从单目视频中高保真重建4D人体 |
scene reconstruction SMPL human motion |
|
|
| 6 |
MTPano: Multi-Task Panoramic Scene Understanding via Label-Free Integration of Dense Prediction Priors |
MTPano:通过无标签密集预测先验集成实现多任务全景场景理解 |
scene understanding foundation model |
|
|
| 7 |
Predicting Camera Pose from Perspective Descriptions for Spatial Reasoning |
提出CAMCUE框架,利用相机位姿进行多视角空间推理和视角预测。 |
scene understanding large language model multimodal |
|
|
| 8 |
NVS-HO: A Benchmark for Novel View Synthesis of Handheld Objects |
NVS-HO:首个手持物体新视角合成的RGB基准数据集 |
gaussian splatting splatting NeRF |
|
|
| 9 |
MerNav: A Highly Generalizable Memory-Execute-Review Framework for Zero-Shot Object Goal Navigation |
提出MerNav框架,解决零样本物体目标导航中泛化性与成功率难以兼顾的问题。 |
open-vocabulary open vocabulary VLN |
|
|
| 10 |
PoseGaussian: Pose-Driven Novel View Synthesis for Robust 3D Human Reconstruction |
提出PoseGaussian,利用姿态引导的高保真人体新视角合成框架 |
depth estimation gaussian splatting splatting |
|
|
| 11 |
IndustryShapes: An RGB-D Benchmark dataset for 6D object pose estimation of industrial assembly components and tools |
IndustryShapes:用于工业装配组件和工具6D位姿估计的RGB-D基准数据集 |
6D pose estimation |
✅ |
|
| 12 |
Feature points evaluation on omnidirectional vision with a photorealistic fisheye sequence -- A report on experiments done in 2014 |
全向视觉特征点评估:基于真实感鱼眼序列的实验报告(2014年) |
visual odometry |
|
|
| 13 |
Dual-Representation Image Compression at Ultra-Low Bitrates via Explicit Semantics and Implicit Textures |
提出双重表征图像压缩框架,融合显式语义和隐式纹理,提升超低码率下压缩性能。 |
implicit representation |
|
|