Natural Humanoid Robot Locomotion with Generative Motion Prior
Haodong Zhang, Liang Zhang, Zhenghan Chen, Lu Chen, Yue Wang, Rong Xiong
TL;DR
This work tackles natural, human-like locomotion for humanoid robots by introducing a Generative Motion Prior (GMP) that offline-trains a CVAE to predict future, human-like whole-body motions based on current pose and velocity commands. During RL, GMP serves as a frozen online motion generator, supplying dense trajectory-level supervision (joint angles and keypoint positions) to guide policy learning with rewards that combine motion guidance, task performance, and stability. The approach relies on whole-body motion retargeting to create realistic robot reference motions, and uses a command-conditioned latent encoder to align generated trajectories with user commands. Experimental results in simulation and on a real NAVIAI humanoid demonstrate improved motion naturalness and stable training compared to baselines, indicating GMP's effectiveness for sim-to-real humanoid locomotion. The method offers a scalable, interpretable path to integrating human-like motion into robot locomotion through the combination of generative priors and reinforcement learning.
Abstract
Natural and lifelike locomotion remains a fundamental challenge for humanoid robots to interact with human society. However, previous methods either neglect motion naturalness or rely on unstable and ambiguous style rewards. In this paper, we propose a novel Generative Motion Prior (GMP) that provides fine-grained motion-level supervision for the task of natural humanoid robot locomotion. To leverage natural human motions, we first employ whole-body motion retargeting to effectively transfer them to the robot. Subsequently, we train a generative model offline to predict future natural reference motions for the robot based on a conditional variational auto-encoder. During policy training, the generative motion prior serves as a frozen online motion generator, delivering precise and comprehensive supervision at the trajectory level, including joint angles and keypoint positions. The generative motion prior significantly enhances training stability and improves interpretability by offering detailed and dense guidance throughout the learning process. Experimental results in both simulation and real-world environments demonstrate that our method achieves superior motion naturalness compared to existing approaches. Project page can be found at https://sites.google.com/view/humanoid-gmp
