Humanoid-Gym: Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real Transfer
Xinyang Gu, Yen-Jen Wang, Jianyu Chen
TL;DR
Humanoid-Gym addresses the sizable sim-to-real transfer gap in humanoid locomotion by offering an open-source RL framework built on Nvidia Isaac Gym, with domain randomization and a sim-to-sim validation path to MuJoCo. The approach uses PPO with asymmetric actor-critic and privileged information, a high-frequency PD-based action interface, and a gait-aware reward design to encourage stable locomotion. It demonstrates zero-shot sim-to-real transfer on two RobotEra humanoids (XBot-S and XBot-L) and validates policies across flat and uneven terrains, aided by MuJoCo calibration that aligns simulated dynamics with real-world behavior. The work provides an accessible, reproducible pipeline and a sim-to-sim analysis tool to bolster robustness before real-world deployment.
Abstract
Humanoid-Gym is an easy-to-use reinforcement learning (RL) framework based on Nvidia Isaac Gym, designed to train locomotion skills for humanoid robots, emphasizing zero-shot transfer from simulation to the real-world environment. Humanoid-Gym also integrates a sim-to-sim framework from Isaac Gym to Mujoco that allows users to verify the trained policies in different physical simulations to ensure the robustness and generalization of the policies. This framework is verified by RobotEra's XBot-S (1.2-meter tall humanoid robot) and XBot-L (1.65-meter tall humanoid robot) in a real-world environment with zero-shot sim-to-real transfer. The project website and source code can be found at: https://sites.google.com/view/humanoid-gym/.
