Learning Getting-Up Policies for Real-World Humanoid Robots
Xialin He, Runpei Dong, Zixuan Chen, Saurabh Gupta
TL;DR
The paper tackles humanoid fall recovery by learning two-stage getting-up and rolling-over policies with a curriculum and sim-to-real transfer. It introduces HumanUP, where a discovery stage finds motion trajectories under weak deployment constraints, followed by a deployable stage that imitates these trajectories under strong regularization and domain randomization to ensure real-world reliability. Real-world experiments on a Unitree G1 demonstrate higher success rates and smoother, safer motions than hand-designed controllers, validating the approach across supine and prone poses on varied terrains. The work highlights the importance of curriculum design, full collision modeling, posture randomization, and soft symmetry for achieving deployable, generalizable policies in contact-rich tasks. This framework advances practical autonomous fall recovery for human-sized humanoids and suggests broader applicability to other complex contact-driven behaviors.
Abstract
Automatic fall recovery is a crucial prerequisite before humanoid robots can be reliably deployed. Hand-designing controllers for getting up is difficult because of the varied configurations a humanoid can end up in after a fall and the challenging terrains humanoid robots are expected to operate on. This paper develops a learning framework to produce controllers that enable humanoid robots to get up from varying configurations on varying terrains. Unlike previous successful applications of learning to humanoid locomotion, the getting-up task involves complex contact patterns (which necessitates accurately modeling of the collision geometry) and sparser rewards. We address these challenges through a two-phase approach that induces a curriculum. The first stage focuses on discovering a good getting-up trajectory under minimal constraints on smoothness or speed / torque limits. The second stage then refines the discovered motions into deployable (i.e. smooth and slow) motions that are robust to variations in initial configuration and terrains. We find these innovations enable a real-world G1 humanoid robot to get up from two main situations that we considered: a) lying face up and b) lying face down, both tested on flat, deformable, slippery surfaces and slopes (e.g., sloppy grass and snowfield). This is one of the first successful demonstrations of learned getting-up policies for human-sized humanoid robots in the real world.
