cs.CV(2025-07-22)

📊 共 4 篇论文

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (2) 支柱一:机器人控制 (Robot Control) (1) 支柱三:空间感知与语义 (Perception & Semantics) (1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (2 篇)

#题目一句话要点标签🔗
1 C2-Evo: Co-Evolving Multimodal Data and Model for Self-Improving Reasoning C2-Evo:协同进化多模态数据与模型,实现自我提升的推理能力 reinforcement learning large language model multimodal
2 Zebra-CoT: A Dataset for Interleaved Vision Language Reasoning 提出Zebra-CoT数据集,用于提升视觉语言模型在复杂推理任务中的表现 reinforcement learning multimodal chain-of-thought

🔬 支柱一:机器人控制 (Robot Control) (1 篇)

#题目一句话要点标签🔗
3 ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning 提出ThinkAct框架,通过强化视觉潜在规划实现视觉-语言-动作推理 manipulation embodied AI vision-language-action

🔬 支柱三:空间感知与语义 (Perception & Semantics) (1 篇)

#题目一句话要点标签🔗
4 ReMeREC: Relation-aware and Multi-entity Referring Expression Comprehension 提出ReMeREC框架,解决多实体指代表达理解中关系建模不足的问题。 scene understanding large language model

⬅️ 返回 cs.CV 首页 · 🏠 返回主页