Learning Bipedal Locomotion on Gear-Driven Humanoid Robot Using Foot-Mounted IMUs
Sotaro Katayama, Yuta Koda, Norio Nagatsuka, Masaya Kinoshita
TL;DR
This work tackles the sim-to-real gap in reinforcement learning for gear-driven humanoid robots lacking torque sensors by leveraging foot-mounted IMUs to rapidly stabilize locomotion across diverse terrains. It introduces symmetric data augmentation and random network distillation within a Legged Gym-based RL framework and employs a teacher–student training pipeline with fine-tuning to transfer policies to hardware. Hardware experiments on the EVAL-03 miniature humanoid demonstrate improved stability on non-rigid surfaces and during abrupt terrain transitions, though upward-step navigation remains a challenge. The approach offers a practical path toward robust, torque-free bipedal control for low-cost humanoids, with future work pointing to integrating additional sensing or lower-gain joint control to handle more complex terrains.
Abstract
Sim-to-real reinforcement learning (RL) for humanoid robots with high-gear ratio actuators remains challenging due to complex actuator dynamics and the absence of torque sensors. To address this, we propose a novel RL framework leveraging foot-mounted inertial measurement units (IMUs). Instead of pursuing detailed actuator modeling and system identification, we utilize foot-mounted IMU measurements to enhance rapid stabilization capabilities over challenging terrains. Additionally, we propose symmetric data augmentation dedicated to the proposed observation space and random network distillation to enhance bipedal locomotion learning over rough terrain. We validate our approach through hardware experiments on a miniature-sized humanoid EVAL-03 over a variety of environments. The experimental results demonstrate that our method improves rapid stabilization capabilities over non-rigid surfaces and sudden environmental transitions.
