Learning a Contact-Adaptive Controller for Robust, Efficient Legged Locomotion
Xingye Da, Zhaoming Xie, David Hoeller, Byron Boots, Animashree Anandkumar, Yuke Zhu, Buck Babich, Animesh Garg
TL;DR
This work addresses the challenge of enabling robust and energy-efficient legged locomotion under varying and novel environments. It introduces a hierarchical framework that couples a high-level reinforcement learning policy, which selects from a set of 9 primitives, with a low-level model-based controller that executes these primitives via quadratic programming and swing-foot control. Key contributions include sample-efficient RL training, zero-shot adaptation to novel scenarios, and direct sim-to-real transfer on a Laikago quadruped without randomization. The results show substantial energy savings, adaptive contact sequencing, and robust performance in split-belt and perturbation scenarios, highlighting the practical impact of integrating model-based control with learning for real-time locomotion.
Abstract
We present a hierarchical framework that combines model-based control and reinforcement learning (RL) to synthesize robust controllers for a quadruped (the Unitree Laikago). The system consists of a high-level controller that learns to choose from a set of primitives in response to changes in the environment and a low-level controller that utilizes an established control method to robustly execute the primitives. Our framework learns a controller that can adapt to challenging environmental changes on the fly, including novel scenarios not seen during training. The learned controller is up to 85~percent more energy efficient and is more robust compared to baseline methods. We also deploy the controller on a physical robot without any randomization or adaptation scheme.
