| 1 |
FreeSplat++: Generalizable 3D Gaussian Splatting for Efficient Indoor Scene Reconstruction |
FreeSplat++:面向高效室内场景重建的通用3D高斯溅射 |
3D gaussian splatting 3DGS gaussian splatting |
|
|
| 2 |
NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representations |
NeuralGS:结合神经场与3D高斯溅射,实现紧凑的3D表示 |
3D gaussian splatting 3DGS gaussian splatting |
|
|
| 3 |
Open-Vocabulary Semantic Segmentation with Uncertainty Alignment for Robotic Scene Understanding in Indoor Building Environments |
提出基于不确定性对齐的开放词汇语义分割方法,用于室内机器人场景理解 |
scene understanding open-vocabulary open vocabulary |
|
|
| 4 |
Evaluating Compositional Scene Understanding in Multimodal Generative Models |
评估多模态生成模型在组合场景理解中的能力,揭示其与人类的差距 |
scene understanding multimodal |
|
|
| 5 |
Empowering Large Language Models with 3D Situation Awareness |
提出基于情境感知的大语言模型3D场景理解方法,提升视角依赖任务性能。 |
scene understanding egocentric large language model |
|
|
| 6 |
CityGS-X: A Scalable Architecture for Efficient and Geometrically Accurate Large-Scale Scene Reconstruction |
CityGS-X:一种高效且几何精确的大规模场景重建可扩展架构 |
3D gaussian splatting gaussian splatting splatting |
|
|
| 7 |
Can DeepSeek Reason Like a Surgeon? An Empirical Evaluation for Vision-Language Understanding in Robotic-Assisted Surgery |
评估DeepSeek模型在机器人辅助手术视觉语言理解中的推理能力 |
scene understanding large language model multimodal |
|
|