Gait-Adaptive Perceptive Humanoid Locomotion with Real-Time Under-Base Terrain Reconstruction
Haolin Song, Hongbo Zhu, Tao Yu, Yan Liu, Mingqi Yuan, Wengang Zhou, Hua Chen, Houqiang Li
TL;DR
The paper tackles the difficulty of robust humanoid locomotion on complex terrains by integrating exteroceptive terrain perception with gait timing and full-body control. It introduces a perceptive framework that uses a downward-looking depth camera to produce a dense under-base height map via a lightweight U-Net, feeding a unified policy that outputs both joint commands and a gait-frequency action. A single-stage Successive Teacher–Student (S-TS) training regime enables efficient knowledge transfer from privileged to partial observations, yielding a robust end-to-end policy. The approach is validated on a full-sized 31-DoF humanoid (Oli) in simulation and real-world tests, demonstrating omnidirectional walking, stair and gap traversal, and zero-shot generalization to unseen terrains. This work advances terrain-aware, gait-adaptive humanoid locomotion with practical sim-to-real transfer capabilities.
Abstract
For full-size humanoid robots, even with recent advances in reinforcement learning-based control, achieving reliable locomotion on complex terrains, such as long staircases, remains challenging. In such settings, limited perception, ambiguous terrain cues, and insufficient adaptation of gait timing can cause even a single misplaced or mistimed step to result in rapid loss of balance. We introduce a perceptive locomotion framework that merges terrain sensing, gait regulation, and whole-body control into a single reinforcement learning policy. A downward-facing depth camera mounted under the base observes the support region around the feet, and a compact U-Net reconstructs a dense egocentric height map from each frame in real time, operating at the same frequency as the control loop. The perceptual height map, together with proprioceptive observations, is processed by a unified policy that produces joint commands and a global stepping-phase signal, allowing gait timing and whole-body posture to be adapted jointly to the commanded motion and local terrain geometry. We further adopt a single-stage successive teacher-student training scheme for efficient policy learning and knowledge transfer. Experiments conducted on a 31-DoF, 1.65 m humanoid robot demonstrate robust locomotion in both simulation and real-world settings, including forward and backward stair ascent and descent, as well as crossing a 46 cm gap. Project Page:https://ga-phl.github.io/
