AnchorRoute: Human Motion Synthesis with Interval-Routed Sparse Contro

作者: Pengcheng Fang, Tengjiao Sun, Dongjie Fu, Xiaoyu Zhan, Yanwen Guo, Hansung Kim, Xiaohao Cai

分类: cs.GR, cs.CV, cs.LG

发布日期: 2026-05-14 (更新: 2026-05-15)

💡 一句话要点

AnchorRoute：利用区间路由稀疏控制的人体运动合成框架

🎯 匹配领域: 支柱四：生成式动作 (Generative Motion) 支柱七：动作重定向 (Motion Retargeting) 支柱八：物理动画 (Physics-based Animation) 支柱九：具身大模型 (Embodied Foundation Models)

关键词: 人体运动合成 稀疏控制 扩散模型 运动细化 锚点条件

📋 核心要点

现有方法在稀疏控制人体运动合成中，难以兼顾运动质量和对用户指定锚点的精确遵循。
AnchorRoute框架利用锚点作为生成和细化的共享支架，实现高质量和高精度的运动合成。
实验表明，AnchorRoute在稀疏关键点控制上优于现有方法，并显著提升了锚点依从性。

📝 摘要（中文）

AnchorRoute是一个稀疏锚点运动合成框架，它使用锚点作为生成和细化的共享支架。在生成之前，AnchorRoute将稀疏锚点转换为锚点条件特征，并通过AnchorKV和双上下文条件将生成的条件记忆注入到冻结的Transition Masked Diffusion先验中。这保留了预训练的文本到运动先验的生成质量，同时学习稀疏空间控制。生成后，相同的锚点被评估为残差：它们的时间戳定义了细化区间，它们的残差决定了应该集中校正的位置。然后，RouteSolver通过将软令牌更新投影到锚点定义的逐段仿射区间基上，来细化运动。这在单个锚点支架下，将生成时锚点条件与残差路由细化相结合。AnchorRoute支持根3D、平面根和身体点控制。在基准评估中，AnchorRoute在稀疏关键点协议下优于先前的稀疏控制方法，并始终提高跨控制族的锚点依从性。结果表明，学习到的锚点条件生成器和RouteSolver细化是互补的：生成器保留了文本运动质量，而RouteSolver提供了可控的路径，以实现更强的锚点依从性。

🔬 方法详解

问题定义：论文旨在解决人体运动合成中，如何利用稀疏锚点（如根位置、平面轨迹、身体关键点目标）实现对运动的精确控制，同时保持运动的自然性和真实感的问题。现有方法通常难以在运动质量和锚点依从性之间取得平衡，或者需要针对不同类型的锚点进行单独设计。

核心思路：AnchorRoute的核心思路是将锚点作为运动生成和细化的统一接口。首先，利用锚点信息引导运动生成过程，然后，基于锚点残差对生成的运动进行细化。这种双阶段方法能够充分利用预训练的运动先验知识，同时保证对用户指定锚点的精确遵循。

技术框架：AnchorRoute框架包含两个主要阶段：锚点条件运动生成和残差路由运动细化。在生成阶段，稀疏锚点被转换为锚点条件特征，并注入到预训练的Transition Masked Diffusion模型中，以引导运动生成。在细化阶段，计算生成运动与锚点之间的残差，并利用RouteSolver对运动进行局部调整，以减小残差。

关键创新：AnchorRoute的关键创新在于将锚点同时用于运动生成和细化，并设计了AnchorKV和RouteSolver模块来实现这一目标。AnchorKV模块用于将锚点条件特征注入到扩散模型中，RouteSolver模块则利用锚点残差对运动进行局部细化。这种统一的锚点处理方式简化了系统设计，并提高了运动控制的精度和灵活性。

关键设计：AnchorRoute使用Transition Masked Diffusion模型作为运动生成器，并采用AnchorKV模块将锚点条件特征注入到扩散模型的Transformer层中。RouteSolver模块使用逐段仿射函数作为基函数，将运动更新限制在锚点定义的时间区间内。损失函数包括运动重建损失、锚点依从性损失和运动平滑性损失。

🖼️ 关键图片

📊 实验亮点

实验结果表明，AnchorRoute在稀疏关键点控制任务上优于现有方法，例如在HumanML3D数据集上，AnchorRoute的FID指标显著优于其他基线方法。此外，AnchorRoute在不同类型的锚点控制任务中均表现出良好的性能，证明了其通用性和灵活性。通过消融实验，验证了AnchorKV和RouteSolver模块的有效性。

🎯 应用场景

AnchorRoute可应用于虚拟现实、游戏开发、动画制作等领域，允许用户通过少量关键帧或目标点来控制角色的运动，从而简化运动编辑流程，提高创作效率。该技术还可用于机器人控制，通过指定机器人的目标位置或轨迹，生成平滑自然的运动序列。

📄 摘要（原文）

Sparse anchors provide a compact interface for human motion authoring: users specify a few root positions, planar trajectory samples, or body-point targets, while the system synthesizes the full-body motion that completes the under-specified intent. We present AnchorRoute, a sparse-anchor motion synthesis framework that uses anchors as a shared scaffold for both generation and refinement. Before generation, AnchorRoute converts sparse anchors into anchor-condition features and injects the resulting condition memory into a frozen Transition Masked Diffusion prior through AnchorKV and dual-context conditioning. This preserves the generation quality of the pretrained text-to-motion prior while learning sparse spatial control. After generation, the same anchors are evaluated as residuals: their timestamps define refinement intervals, and their residuals determine where correction should be concentrated. RouteSolver then refines the motion by projecting soft-token updates onto anchor-defined piecewise-affine interval bases. This couples generation-time anchor conditioning with residual-routed refinement under one anchor scaffold. AnchorRoute supports root-3D, planar-root, and body-point control within the same formulation. In benchmark evaluations, AnchorRoute outperforms prior sparse-control methods under the sparse keyjoint protocol and consistently improves anchor adherence across control families. The results show that the learned anchor-conditioned generator and RouteSolver refinement are complementary: the generator preserves text-motion quality, while RouteSolver provides a controllable path toward stronger anchor adherence.

AnchorRoute: Human Motion Synthesis with Interval-Routed Sparse Contro

💡 一句话要点

📋 核心要点

📝 摘要（中文）

🔬 方法详解

🖼️ 关键图片

📊 实验亮点

🎯 应用场景

📄 摘要（原文）

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册

👤 用户管理