cs.RO（2025-11-21）

📊 共 21 篇论文 | 🔗 3 篇有代码

🎯 兴趣领域导航

支柱一：机器人控制 (Robot Control) (9 🔗2) 支柱三：空间感知与语义 (Perception & Semantics) (4 🔗1) 支柱九：具身大模型 (Embodied Foundation Models) (4) 支柱二：RL算法与架构 (RL & Architecture) (2) 支柱六：视频提取与匹配 (Video Extraction) (1) 支柱三：空间感知 (Perception & SLAM) (1)

🔬 支柱一：机器人控制 (Robot Control) (9 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Human Imitated Bipedal Locomotion with Frequency Based Gait Generator Network	提出基于频率的步态生成网络，结合PPO控制，实现类人双足稳健行走	bipedal biped locomotion
2	Agility Meets Stability: Versatile Humanoid Control with Heterogeneous Data	AMS：融合异构数据，实现敏捷与稳定兼备的通用人形机器人控制	humanoid humanoid robot humanoid control
3	SM2ITH: Safe Mobile Manipulation with Interactive Human Prediction via Task-Hierarchical Bilevel Model Predictive Control	SM²ITH：基于任务分层双层MPC的安全移动操作与交互式人类行为预测	manipulation mobile manipulation model predictive control
4	METIS: Multi-Source Egocentric Training for Integrated Dexterous Vision-Language-Action Model	METIS：多源自中心训练的集成灵巧视觉-语言-动作模型	manipulation dexterous manipulation teleoperation
5	RoboCOIN: An Open-Sourced Bimanual Robotic Data COllection for INtegrated Manipulation	RoboCOIN：开源多形态双臂机器人操作数据集，促进集成操作学习	manipulation bi-manual bimanual manipulation	✅
6	Learning Diffusion Policies for Robotic Manipulation of Timber Joinery under Fabrication Uncertainty	提出基于扩散策略学习的木材榫卯结构机器人操作方法，解决装配不确定性问题	manipulation policy learning diffusion policy
7	See, Plan, Cut: MPC-Based Autonomous Volumetric Robotic Laser Surgery with OCT Guidance	提出RATS系统以解决现有机器人激光手术缺乏体积规划与反馈的问题	MPC model predictive control
8	H-GAR: A Hierarchical Interaction Framework via Goal-Driven Observation-Action Refinement for Robotic Manipulation	提出H-GAR框架，通过目标驱动的观察-动作细化实现更精确的机器人操作	manipulation
9	Vector Cost Behavioral Planning for Autonomous Robotic Systems with Contemporary Validation Strategies	提出基于向量代价双矩阵博弈的机器人行为规划方法，提升多目标决策性能。	motion planning	✅

🔬 支柱三：空间感知与语义 (Perception & Semantics) (4 篇)

#	题目	一句话要点	标签	🔗	⭐
10	HALO: High-Altitude Language-Conditioned Monocular Aerial Exploration and Navigation	HALO：基于单目视觉和语言条件的高空无人机探索与导航	semantic mapping semantic map language conditioned	✅
11	Robot Confirmation Generation and Action Planning Using Long-context Q-Former Integrated with Multimodal LLM	提出基于长上下文Q-Former与多模态LLM的机器人动作确认与规划方法	scene understanding multimodal
12	TP-MDDN: Task-Preferenced Multi-Demand-Driven Navigation with Autonomous Decision-Making	提出TP-MDDN基准与AWMSystem系统，解决具身AI中多需求任务偏好导航问题	semantic mapping semantic map embodied AI
13	Vision-Guided Optic Flow Navigation for Small Lunar Missions	针对小型月球任务，提出一种基于视觉引导光流导航的轻量级自主导航方案	depth estimation optical flow

🔬 支柱九：具身大模型 (Embodied Foundation Models) (4 篇)

#	题目	一句话要点	标签	🔗	⭐
14	Efficient Robot Design with Multi-Objective Black-Box Optimization and Large Language Models	提出基于LLM辅助黑盒优化的机器人高效设计方法	large language model
15	SPEAR-1: Scaling Beyond Robot Demonstrations via 3D Understanding	SPEAR-1：通过3D理解扩展机器人演示学习能力	foundation model
16	FORWARD: Dataset of a forwarder operating in rough terrain	发布FORWARD数据集，用于林业机械在复杂地形下的感知、控制与仿真研究。	multimodal
17	Progress-Think: Semantic Progress Reasoning for Vision-Language Navigation	Progress-Think提出语义进度推理，提升视觉-语言导航任务的性能和效率。	vision-language-action

🔬 支柱二：RL算法与架构 (RL & Architecture) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
18	RynnVLA-002: A Unified Vision-Language-Action and World Model	RynnVLA-002：提出统一的视觉-语言-动作和世界模型，提升机器人任务性能。	world model vision-language-action VLA
19	Stable Offline Hand-Eye Calibration for any Robot with Just One Mark	提出CalibAll，仅用单标记实现任意机器人离线手眼标定的稳定方法	imitation learning foundation model

🔬 支柱六：视频提取与匹配 (Video Extraction) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
20	IndustryNav: Exploring Spatial Reasoning of Embodied Agents in Dynamic Industrial Navigation	提出IndustryNav：动态工业导航场景下具身智能体的空间推理基准	egocentric egocentric vision large language model

🔬 支柱三：空间感知 (Perception & SLAM) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
21	Single-Pixel Tactile Skin via Compressive Sampling	提出基于压缩感知的单像素触觉皮肤，解决大面积触觉传感的数据瓶颈问题	localization

⬅️ 返回 cs.RO 首页 · 🏠 返回主页