Think on your feet: Seamless Transition between Human-like Locomotion in Response to Changing Commands
Huaxing Huang, Wenhao Cui, Tonghe Zhang, Shengtao Li, Jinchao Han, Bangyu Qin, Tianchu Zhang, Liang Zheng, Ziyang Tang, Chenxu Hu, Ning Yan, Jiahao Chen, Shipu Zhang, Zheyuan Jiang
TL;DR
This work addresses the problem of enabling seamless transitions between human-like locomotion in humanoid robots under changing velocity commands. It formulates locomotion control as a partially observable MDP and integrates a Hybrid Internal Model for velocity/state estimation, a Wasserstein-divergence discriminator to prevent mode collapse in imitation learning, and a curiosity bonus to encourage exploration, with motion retargeting from MoCap and domain randomization to bridge sim-to-real gaps. The approach yields a novel architecture that generalizes to unseen intermediate motions and transfers zero-shot from simulation to real hardware, validated across multiple terrains and robot platforms. The results demonstrate improved velocity tracking, more natural human-like gaits, and robust multitasking, highlighting practical potential for versatile humanoid locomotion in human-centric environments.
Abstract
While it is relatively easier to train humanoid robots to mimic specific locomotion skills, it is more challenging to learn from various motions and adhere to continuously changing commands. These robots must accurately track motion instructions, seamlessly transition between a variety of movements, and master intermediate motions not present in their reference data. In this work, we propose a novel approach that integrates human-like motion transfer with precise velocity tracking by a series of improvements to classical imitation learning. To enhance generalization, we employ the Wasserstein divergence criterion (WGAN-div). Furthermore, a Hybrid Internal Model provides structured estimates of hidden states and velocity to enhance mobile stability and environment adaptability, while a curiosity bonus fosters exploration. Our comprehensive method promises highly human-like locomotion that adapts to varying velocity requirements, direct generalization to unseen motions and multitasking, as well as zero-shot transfer to the simulator and the real world across different terrains. These advancements are validated through simulations across various robot models and extensive real-world experiments.
