Whole-body Humanoid Robot Locomotion with Human Reference
Qiang Zhang, Peter Cui, David Yan, Jingkai Sun, Yiqun Duan, Gang Han, Wen Zhao, Weining Zhang, Yijie Guo, Arthur Zhang, Renjing Xu
TL;DR
This work introduces Adam, a full-size humanoid robot, and a novel imitation-learning framework using adversarial motion priors guided by human motion data to overcome reward-design and sim-to-real challenges in humanoid locomotion. The approach combines AMP-based adversarial learning with end-to-end PPO reinforcement learning, employing world-coordinate rewards, detailed periodic gait incentives, and extensive domain randomization to achieve robust, human-like locomotion across unseen terrains. Experimental validation spans Webots, Isaac Gym, and real hardware, demonstrating cross-platform consistency and real-world viability, including previously unseen capabilities like heel-to-toe running and straight-knee arm swing. The study advances practical humanoid locomotion by reducing manual reward engineering and enabling scalable, perception-augmented future enhancements for autonomous mobility.
Abstract
Recently, humanoid robots have made significant advances in their ability to perform challenging tasks due to the deployment of Reinforcement Learning (RL), however, the inherent complexity of humanoid robots, including the difficulty of designing complicated reward functions and training entire sophisticated systems, still poses a notable challenge. To conquer these challenges, after many iterations and in-depth investigations, we have meticulously developed a full-size humanoid robot, "Adam", whose innovative structural design greatly improves the efficiency and effectiveness of the imitation learning process. In addition, we have developed a novel imitation learning framework based on an adversarial motion prior, which applies not only to Adam but also to humanoid robots in general. Using the framework, Adam can exhibit unprecedented human-like characteristics in locomotion tasks. Our experimental results demonstrate that the proposed framework enables Adam to achieve human-comparable performance in complex locomotion tasks, marking the first time that human locomotion data has been used for imitation learning in a full-size humanoid robot.
