| 1 |
Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection |
提出GLIS框架,利用全局-局部协作推理和LLM提升LiDAR开放词汇检测性能。 |
open-vocabulary open vocabulary large language model |
✅ |
|
| 2 |
StyleSplat: 3D Object Style Transfer with Gaussian Splatting |
StyleSplat:基于高斯溅射的3D物体风格迁移方法 |
3D gaussian splatting gaussian splatting splatting |
|
|
| 3 |
DART: An Automated End-to-End Object Detection Pipeline with Data Diversification, Open-Vocabulary Bounding Box Annotation, Pseudo-Label Review, and Model Training |
DART:自动化端到端目标检测流水线,解决标注难题并提升检测精度。 |
open-vocabulary open vocabulary multimodal |
✅ |
|
| 4 |
Open Vocabulary Multi-Label Video Classification |
提出基于LLM语义引导的开放词汇多标签视频分类方法,提升视频理解能力。 |
open-vocabulary open vocabulary large language model |
|
|
| 5 |
ProDepth: Boosting Self-Supervised Multi-Frame Monocular Depth with Probabilistic Fusion |
ProDepth:利用概率融合提升自监督多帧单目深度估计 |
depth estimation monocular depth feature matching |
|
|
| 6 |
KGpose: Keypoint-Graph Driven End-to-End Multi-Object 6D Pose Estimation via Point-Wise Pose Voting |
KGpose:基于关键点图和逐点姿态投票的多目标6D姿态端到端估计 |
6D pose estimation |
|
|
| 7 |
Physics-Informed Learning of Characteristic Trajectories for Smoke Reconstruction |
提出神经特征轨迹场,用于烟雾重建中长期物理约束建模 |
NeRF scene reconstruction |
✅ |
|
| 8 |
Radiance Fields from Photons |
提出基于单光子相机(SPC)的Quanta NeRF,解决低光、高动态范围和高速运动下的NeRF重建问题。 |
NeRF neural radiance field |
|
|
| 9 |
HPC: Hierarchical Progressive Coding Framework for Volumetric Video |
提出HPC框架,以单模型实现神经辐射场体积视频的灵活可变码率压缩。 |
NeRF neural radiance field |
|
|
| 10 |
Imaging Interiors: An Implicit Solution to Electromagnetic Inverse Scattering Problems |
提出基于隐式表达的电磁逆散射方法,用于非侵入式内部成像 |
implicit representation |
✅ |
|