| 1 |
Moment-Based 3D Gaussian Splatting: Resolving Volumetric Occlusion with Order-Independent Transmittance |
提出基于矩的3D高斯溅射,通过与顺序无关的透射率解决体积遮挡问题。 |
3D gaussian splatting 3DGS gaussian splatting |
|
|
| 2 |
Prior-Enhanced Gaussian Splatting for Dynamic Scene Reconstruction from Casual Video |
提出先验增强的高斯溅射方法,用于从随手拍摄的视频中重建动态场景。 |
gaussian splatting splatting scene reconstruction |
|
|
| 3 |
Lightweight 3D Gaussian Splatting Compression via Video Codec |
提出基于视频编解码器的轻量级3D高斯溅射压缩方法,提升低比特率下的压缩性能。 |
3D gaussian splatting gaussian splatting splatting |
✅ |
|
| 4 |
VLM2GeoVec: Toward Universal Multimodal Embeddings for Remote Sensing |
提出VLM2GeoVec,用于遥感领域通用多模态嵌入,统一检索与区域理解。 |
scene understanding multimodal instruction following |
|
|
| 5 |
Semantic-Drive: Democratizing Long-Tail Data Curation via Open-Vocabulary Grounding and Neuro-Symbolic VLM Consensus |
Semantic-Drive:通过开放词汇 grounding 和神经符号 VLM 共识实现长尾数据挖掘 |
open-vocabulary open vocabulary symbolic grounding |
|
|
| 6 |
MultiEgo: A Multi-View Egocentric Video Dataset for 4D Scene Reconstruction |
提出MultiEgo数据集,用于多视角以自我中心视频的4D场景重建 |
scene reconstruction egocentric |
|
|
| 7 |
Depth-Copy-Paste: Multimodal and Depth-Aware Compositing for Robust Face Detection |
提出Depth-Copy-Paste,通过多模态深度感知合成增强人脸检测鲁棒性。 |
Depth Anything multimodal |
|
|
| 8 |
Evaluating Foundation Models' 3D Understanding Through Multi-View Correspondence Analysis |
提出基于多视角对应分析的3D理解能力评估基准,无需微调。 |
scene understanding foundation model |
✅ |
|
| 9 |
On Geometric Understanding and Learned Data Priors in VGGT |
分析VGGT几何理解能力:揭示其隐式几何建模与数据先验依赖 |
VGGT foundation model |
|
|
| 10 |
Reconstruction as a Bridge for Event-Based Visual Question Answering |
提出基于重建桥梁的事件相机视觉问答方法,并构建EvQA基准 |
scene understanding large language model multimodal |
|
|
| 11 |
Structure From Tracking: Distilling Structure-Preserving Motion for Video Generation |
提出SAM2VideoX,通过结构保持的运动先验提升视频生成质量 |
optical flow motion representation |
|
|
| 12 |
Super-Resolved Canopy Height Mapping from Sentinel-2 Time Series Using LiDAR HD Reference Data across Metropolitan France |
提出THREASURE-Net,利用Sentinel-2时间序列和LiDAR数据进行高分辨率森林冠层高度制图。 |
height map |
✅ |
|