DynaRetarget: Dynamically-Feasible Retargeting using Sampling-Based Trajectory Optimization
Victor Dhedin, Ilyass Taouil, Shafeef Omar, Dian Yu, Kun Tao, Angela Dai, Majid Khadiv
TL;DR
DynaRetarget tackles the challenge of converting human demonstrations into dynamically feasible humanoid loco-manipulation trajectories by combining IK-based retargeting with Sampling-Based Trajectory Optimization (SBTO) that incrementally grows the optimization horizon. This long-horizon refinement addresses the myopic and brittle behavior of prior SBMPC-based retargeting approaches, producing smoother, physically consistent trajectories that generalize across varied object properties. The refined trajectories are then used to train RL tracking policies with domain randomization, achieving robust zero-shot transfer to real humanoid hardware and enabling scalable synthetic data generation for loco-manipulation. The results show higher retargeting success rates than state-of-the-art baselines and clear benefits for downstream RL learning, highlighting the method’s practical impact for real-world robotics and data-driven policy development.
Abstract
In this paper, we introduce DynaRetarget, a complete pipeline for retargeting human motions to humanoid control policies. The core component of DynaRetarget is a novel Sampling-Based Trajectory Optimization (SBTO) framework that refines imperfect kinematic trajectories into dynamically feasible motions. SBTO incrementally advances the optimization horizon, enabling optimization over the entire trajectory for long-horizon tasks. We validate DynaRetarget by successfully retargeting hundreds of humanoid-object demonstrations and achieving higher success rates than the state of the art. The framework also generalizes across varying object properties, such as mass, size, and geometry, using the same tracking objective. This ability to robustly retarget diverse demonstrations opens the door to generating large-scale synthetic datasets of humanoid loco-manipulation trajectories, addressing a major bottleneck in real-world data collection.
