cs.RO(2025-10-18)
📊 共 4 篇论文 | 🔗 2 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (2 🔗1)
支柱三:空间感知与语义 (Perception & Semantics) (1 🔗1)
支柱一:机器人控制 (Robot Control) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | Do What You Say: Steering Vision-Language-Action Models via Runtime Reasoning-Action Alignment Verification | 提出运行时推理-行动对齐验证方法,提升VLA模型在机器人任务中的泛化性。 | vision-language-action VLA instruction following | ✅ | |
| 2 | Semi-Peaucellier Linkage and Differential Mechanism for Linear Pinching and Self-Adaptive Grasping | 提出SP-Diff平行夹爪系统,通过半反演连杆和差动机构实现线性夹持和自适应抓取。 | multimodal |
🔬 支柱三:空间感知与语义 (Perception & Semantics) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 3 | DIV-Nav: Open-Vocabulary Spatial Relationships for Multi-Object Navigation | DIV-Nav:利用开放词汇空间关系进行多目标导航 | semantic mapping semantic map open-vocabulary | ✅ |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 4 | MoS-VLA: A Vision-Language-Action Model with One-Shot Skill Adaptation | MoS-VLA:基于技能组合的VLA模型,实现机器人单样本技能迁移 | manipulation vision-language-action VLA |