cs.CV(2025-05-09)
📊 共 6 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
支柱二:RL算法与架构 (RL & Architecture) (3)
支柱九:具身大模型 (Embodied Foundation Models) (2 🔗1)
支柱三:空间感知与语义 (Perception & Semantics) (1)
🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | Temperature-Driven Robust Disease Detection in Brain and Gastrointestinal Disorders via Context-Aware Adaptive Knowledge Distillation | 提出基于上下文感知自适应知识蒸馏的稳健疾病检测方法,提升脑部和胃肠道疾病诊断精度。 | teacher-student distillation | ||
| 2 | Topo-VM-UNetV2: Encoding Topology into Vision Mamba UNet for Polyp Segmentation | Topo-VM-UNetV2:将拓扑信息编码进Vision Mamba UNet用于息肉分割 | Mamba state space model | ||
| 3 | VIN-NBV: A View Introspection Network for Next-Best-View Selection | 提出VIN-NBV,通过视角自省网络优化三维重建的下一最佳视角选择 | reinforcement learning deep reinforcement learning |
🔬 支柱九:具身大模型 (Embodied Foundation Models) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 4 | Adapting a Segmentation Foundation Model for Medical Image Classification | 提出一种基于SAM的医学图像分类框架,利用空间局部通道注意力提升性能 | foundation model | ||
| 5 | MM-Skin: Enhancing Dermatology Vision-Language Model with an Image-Text Dataset Derived from Textbooks | 提出MM-Skin数据集和SkinVL模型,提升皮肤科视觉-语言模型在皮肤疾病诊断分析中的性能。 | multimodal instruction following | ✅ |
🔬 支柱三:空间感知与语义 (Perception & Semantics) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 6 | Camera-Only Bird's Eye View Perception: A Neural Approach to LiDAR-Free Environmental Mapping for Autonomous Vehicles | 提出基于纯视觉的鸟瞰图感知框架,用于低成本自动驾驶环境建模。 | depth estimation monocular depth Depth Anything |