cs.RO（2026-03-18）

📊 共 20 篇论文 | 🔗 4 篇有代码

🎯 兴趣领域导航

支柱一：机器人控制 (Robot Control) (8 🔗1) 支柱三：空间感知与语义 (Perception & Semantics) (5 🔗2) 支柱九：具身大模型 (Embodied Foundation Models) (4) 支柱二：RL算法与架构 (RL & Architecture) (2 🔗1) 支柱六：视频提取与匹配 (Video Extraction) (1)

🔬 支柱一：机器人控制 (Robot Control) (8 篇)

#	题目	一句话要点	标签	🔗	⭐
1	RoboForge: Physically Optimized Text-guided Whole-Body Locomotion for Humanoids	RoboForge：面向人形机器人的物理优化文本引导全身运动框架	humanoid humanoid robot humanoid locomotion
2	ProbeFlow: Training-Free Adaptive Flow Matching for Vision-Language-Action Models	ProbeFlow：面向VLA模型的免训练自适应Flow Matching加速框架	manipulation flow matching vision-language-action
3	DexViTac: Collecting Human Visuo-Tactile-Kinematic Demonstrations for Contact-Rich Dexterous Manipulation	DexViTac：用于灵巧操作的视觉-触觉-运动学多模态数据采集系统	manipulation dexterous manipulation representation learning	✅
4	KineVLA: Towards Kinematics-Aware Vision-Language-Action Models with Bi-Level Action Decomposition	KineVLA：提出一种双层动作分解的运动学感知视觉-语言-动作模型	manipulation vision-language-action VLA
5	VolumeDP: Modeling Volumetric Representation for Manipulation Policy Learning	VolumeDP：通过建模体积表示提升操作策略学习性能	manipulation policy learning imitation learning
6	REAL: Robust Extreme Agility via Spatio-Temporal Policy Learning and Physics-Guided Filtering	REAL：基于时空策略学习和物理引导滤波的鲁棒极限敏捷控制	quadruped parkour Unitree
7	EVA: Aligning Video World Models with Executable Robot Actions via Inverse Dynamics Rewards	EVA：通过逆动力学奖励对齐视频世界模型与可执行机器人动作，解决执行鸿沟问题。	bi-manual reinforcement learning world model
8	Full Stack Navigation, Mapping, and Planning for the Lunar Autonomy Challenge	为月球自主挑战赛设计全栈导航、建图与规划系统，荣获第一名	motion planning visual odometry

🔬 支柱三：空间感知与语义 (Perception & Semantics) (5 篇)

#	题目	一句话要点	标签	🔗	⭐
9	SafeLand: Safe Autonomous Landing in Unknown Environments with Bayesian Semantic Mapping	SafeLand：基于贝叶斯语义地图的未知环境安全自主着陆系统	semantic mapping semantic map	✅
10	AgentVLN: Towards Agentic Vision-and-Language Navigation	提出AgentVLN，解决视觉语言导航中空间感知和表征不一致问题	metric depth VLN chain-of-thought	✅
11	P$^{3}$Nav: End-to-End Perception, Prediction and Planning for Vision-and-Language Navigation	提出P$^{3}$Nav，通过感知、预测和规划一体化框架提升视觉语言导航性能。	scene understanding semantic map VLN
12	Visual SLAM with DEM Anchoring for Lunar Surface Navigation	提出DEM约束的视觉SLAM，用于月球表面长距离导航	visual SLAM
13	Neural Radiance Maps for Extraterrestrial Navigation and Path Planning	提出基于NeRF的星外导航方法，解决行星探测车全局地图构建与路径规划问题	NeRF neural radiance field

🔬 支柱九：具身大模型 (Embodied Foundation Models) (4 篇)

#	题目	一句话要点	标签	🔗	⭐
14	HeiSD: Hybrid Speculative Decoding for Embodied Vision-Language-Action Models with Kinematic Awareness	HeiSD：基于运动学感知的具身视觉-语言-动作模型混合推测解码加速框架	vision-language-action VLA
15	Specification-Aware Distribution Shaping for Robotics Foundation Models	提出一种规范感知的分布调整框架，增强机器人基础模型对时序逻辑约束的满足能力	foundation model
16	FloorPlan-VLN: A New Paradigm for Floor Plan Guided Vision-Language Navigation	提出FloorPlan-VLN，利用语义平面图引导视觉-语言导航，提升空间推理能力。	VLN
17	AERR-Nav: Adaptive Exploration-Recovery-Reminiscing Strategy for Zero-Shot Object Navigation	提出AERR-Nav以解决多层环境中的零-shot物体导航问题	large language model multimodal

🔬 支柱二：RL算法与架构 (RL & Architecture) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
18	Generative Control as Optimization: Time Unconditional Flow Matching for Adaptive and Robust Robotic Control	提出GeCO：一种时间无条件Flow Matching框架，用于自适应和鲁棒的机器人控制。	imitation learning flow matching vision-language-action	✅
19	Interpreting Context-Aware Human Preferences for Multi-Objective Robot Navigation	提出一种融合VLM和LLM的多目标强化学习机器人导航方法，实现上下文感知的人类偏好理解。	reinforcement learning large language model

🔬 支柱六：视频提取与匹配 (Video Extraction) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
20	OmniVLN: Omnidirectional 3D Perception and Token-Efficient LLM Reasoning for Visual-Language Navigation across Air and Ground Platforms	OmniVLN：面向空地平台视觉语言导航的3D全向感知与高效LLM推理	egocentric multimodal

⬅️ 返回 cs.RO 首页 · 🏠 返回主页