cs.CV(2025-07-22)
📊 共 4 篇论文
🎯 兴趣领域导航
支柱二:RL算法与架构 (RL & Architecture) (2)
支柱一:机器人控制 (Robot Control) (1)
支柱三:空间感知与语义 (Perception & Semantics) (1)
🔬 支柱二:RL算法与架构 (RL & Architecture) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | C2-Evo: Co-Evolving Multimodal Data and Model for Self-Improving Reasoning | C2-Evo:协同进化多模态数据与模型,实现自我提升的推理能力 | reinforcement learning large language model multimodal | ||
| 2 | Zebra-CoT: A Dataset for Interleaved Vision Language Reasoning | 提出Zebra-CoT数据集,用于提升视觉语言模型在复杂推理任务中的表现 | reinforcement learning multimodal chain-of-thought |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 3 | ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning | 提出ThinkAct框架,通过强化视觉潜在规划实现视觉-语言-动作推理 | manipulation embodied AI vision-language-action |
🔬 支柱三:空间感知与语义 (Perception & Semantics) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 4 | ReMeREC: Relation-aware and Multi-entity Referring Expression Comprehension | 提出ReMeREC框架,解决多实体指代表达理解中关系建模不足的问题。 | scene understanding large language model |