cs.CV(2025-05-10)

📊 共 11 篇论文 | 🔗 2 篇有代码

🎯 兴趣领域导航

支柱三:空间感知与语义 (Perception & Semantics) (4) 支柱九:具身大模型 (Embodied Foundation Models) (3 🔗2) 支柱六:视频提取与匹配 (Video Extraction) (1) 支柱二:RL算法与架构 (RL & Architecture) (1) 支柱四:生成式动作 (Generative Motion) (1) 支柱八:物理动画 (Physics-based Animation) (1)

🔬 支柱三:空间感知与语义 (Perception & Semantics) (4 篇)

#题目一句话要点标签🔗
1 METOR: A Unified Framework for Mutual Enhancement of Objects and Relationships in Open-vocabulary Video Visual Relationship Detection 提出METOR框架以解决开放词汇视频视觉关系检测问题 open-vocabulary open vocabulary
2 Causal Prompt Calibration Guided Segment Anything Model for Open-Vocabulary Multi-Entity Segmentation 提出因果提示校准方法以解决开放词汇多实体分割问题 open-vocabulary open vocabulary
3 Edge-Enabled VIO with Long-Tracked Features for High-Accuracy Low-Altitude IoT Navigation 提出长跟踪特征的边缘启用VIO以解决低空IoT导航中的定位漂移问题 VIO
4 ElectricSight: 3D Hazard Monitoring for Power Lines Using Low-Cost Sensors 提出ElectricSight以解决电力线路3D危险监测问题 depth estimation monocular depth

🔬 支柱九:具身大模型 (Embodied Foundation Models) (3 篇)

#题目一句话要点标签🔗
5 Batch Augmentation with Unimodal Fine-tuning for Multimodal Learning 提出批量增强与单模态微调以检测胎儿器官 large language model multimodal
6 TACFN: Transformer-based Adaptive Cross-modal Fusion Network for Multimodal Emotion Recognition 提出TACFN以解决多模态情感识别中的特征冗余问题 multimodal
7 Improving Generalization of Medical Image Registration Foundation Model 提出Sharpness-Aware Minimization以增强医学图像配准基础模型的泛化能力 foundation model

🔬 支柱六:视频提取与匹配 (Video Extraction) (1 篇)

#题目一句话要点标签🔗
8 GRACE: Estimating Geometry-level 3D Human-Scene Contact from 2D Images 提出GRACE以解决3D人类-场景接触估计问题 SMPL embodied AI

🔬 支柱二:RL算法与架构 (RL & Architecture) (1 篇)

#题目一句话要点标签🔗
9 Dataset Distillation with Probabilistic Latent Features 提出基于概率潜在特征的数据集蒸馏方法以降低计算成本 distillation

🔬 支柱四:生成式动作 (Generative Motion) (1 篇)

#题目一句话要点标签🔗
10 HDGlyph: A Hierarchical Disentangled Glyph-Based Framework for Long-Tail Text Rendering in Diffusion Models 提出HDGlyph框架以解决长尾文本渲染问题 classifier-free guidance

🔬 支柱八:物理动画 (Physics-based Animation) (1 篇)

#题目一句话要点标签🔗
11 ProFashion: Prototype-guided Fashion Video Generation with Multiple Reference Images 提出ProFashion以解决时尚视频生成中的视角一致性问题 spatiotemporal

⬅️ 返回 cs.CV 首页 · 🏠 返回主页