cs.CV(2025-01-29)
📊 共 13 篇论文 | 🔗 3 篇有代码
🎯 兴趣领域导航
支柱三:空间感知与语义 (Perception & Semantics) (7 🔗1)
支柱九:具身大模型 (Embodied Foundation Models) (5 🔗2)
支柱二:RL算法与架构 (RL & Architecture) (1)
🔬 支柱三:空间感知与语义 (Perception & Semantics) (7 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | VoD-3DGS: View-opacity-Dependent 3D Gaussian Splatting | 提出视角-不透明度相关的3D高斯溅射,用于增强视角依赖渲染效果 | 3D gaussian splatting 3DGS gaussian splatting | ||
| 2 | FeatureGS: Eigenvalue-Feature Optimization in 3D Gaussian Splatting for Geometrically Accurate and Artifact-Reduced Reconstruction | FeatureGS:基于特征值优化的3D高斯溅射,实现几何精确和伪影减少的重建 | 3D gaussian splatting 3DGS gaussian splatting | ||
| 3 | CrowdSplat: Exploring Gaussian Splatting For Crowd Rendering | CrowdSplat:探索高斯溅射在人群渲染中的应用,实现高质量实时渲染。 | 3D gaussian splatting gaussian splatting splatting | ||
| 4 | Efficient Redundancy Reduction for Open-Vocabulary Semantic Segmentation | ERR-Seg:通过减少冗余信息,高效解决开放词汇语义分割问题 | open-vocabulary open vocabulary | ||
| 5 | HOMER: Homography-Based Efficient Multi-view 3D Object Removal | HOMER:基于单应性的高效多视角3D物体移除方法 | 3D gaussian splatting gaussian splatting splatting | ||
| 6 | SSF: Sparse Long-Range Scene Flow for Autonomous Driving | 提出基于稀疏卷积的长程场景流方法SSF,提升自动驾驶感知能力。 | scene flow | ✅ | |
| 7 | 3D Reconstruction of Shoes for Augmented Reality | 提出基于3D高斯溅射的移动端鞋类3D重建与AR展示方案,提升在线购物体验 | 3D gaussian splatting gaussian splatting splatting |
🔬 支柱九:具身大模型 (Embodied Foundation Models) (5 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 8 | Multimodal Large Language Models for Image, Text, and Speech Data Augmentation: A Survey | 综述:多模态大语言模型在图像、文本和语音数据增强中的应用 | large language model multimodal | ✅ | |
| 9 | LFTR: Learning-Free Token Reduction for Multimodal Large Language Models | 提出一种免训练的视觉Token缩减方法LFTR,用于加速多模态大语言模型推理。 | large language model multimodal | ||
| 10 | Robust Multimodal Learning via Cross-Modal Proxy Tokens | 提出跨模态代理令牌(CMPT),增强多模态模型在模态缺失情况下的鲁棒性。 | multimodal | ✅ | |
| 11 | Technical report on label-informed logit redistribution for better domain generalization in low-shot classification with foundation models | 提出信心错位惩罚以改善低样本分类的领域泛化问题 | foundation model | ||
| 12 | General Scene Adaptation for Vision-and-Language Navigation | 提出GSA-VLN以解决视觉-语言导航中的环境适应问题 | VLN |
🔬 支柱二:RL算法与架构 (RL & Architecture) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 13 | Glioma Multimodal MRI Analysis System for Tumor Layered Diagnosis via Multi-task Semi-supervised Learning | 提出GMMAS:基于多任务半监督学习的脑胶质瘤多模态MRI分层诊断系统 | contrastive learning distillation multimodal |