cs.CV(2025-01-20)
📊 共 12 篇论文 | 🔗 2 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (7 🔗2)
支柱三:空间感知与语义 (Perception & Semantics) (3)
支柱二:RL算法与架构 (RL & Architecture) (2)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (7 篇)
🔬 支柱三:空间感知与语义 (Perception & Semantics) (3 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 8 | See In Detail: Enhancing Sparse-view 3D Gaussian Splatting with Local Depth and Semantic Regularization | 提出局部深度和语义正则化的稀疏视角3D高斯溅射方法,提升渲染质量。 | 3D gaussian splatting 3DGS gaussian splatting | ||
| 9 | Dynamic Scene Understanding from Vision-Language Representations | 利用视觉-语言表征进行动态场景理解,无需大量任务特定工程。 | scene understanding human-object interaction | ||
| 10 | Event-based vision for egomotion estimation using precise event timing | 提出基于精确事件时间信息的事件相机运动估计方法,适用于低功耗机器人应用。 | optical flow motion tracking |
🔬 支柱二:RL算法与架构 (RL & Architecture) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 11 | EndoChat: Grounded Multimodal Large Language Model for Endoscopic Surgery | EndoChat:用于内窥镜手术的具身多模态大型语言模型 | representation learning scene understanding large language model | ||
| 12 | DEFEND: A Large-scale 1M Dataset and Foundation Model for Tobacco Addiction Prevention | 提出 Tobacco-1M 数据集与 DEFEND 烟草成瘾预防基础模型,提升烟草产品监管能力。 | representation learning foundation model multimodal |