GenLoco: Generalized Locomotion Controllers for Quadrupedal Robots
Gilbert Feng, Hongbo Zhang, Zhongyu Li, Xue Bin Peng, Bhuvan Basireddy, Linzhu Yue, Zhitao Song, Lizhi Yang, Yunhui Liu, Koushil Sreenath, Sergey Levine
TL;DR
GenLoco tackles the challenge of creating broadly applicable locomotion controllers for quadrupedal robots by training a single phase- and history-conditioned policy on a wide range of procedurally generated morphologies. The method randomizes morphology and dynamics in simulation and uses a simple feedforward policy that outputs target joint positions $\mathbf{q}^d \in \mathbb{R}^{12}$, which are added to time-invariant nominal poses and filtered before PD control, all learned with PPO at $30$ Hz. Key contributions include the GenLoco framework, a straightforward morphology randomization scheme, and extensive zero-shot real-world and out-of-distribution evaluations demonstrating transfer to unseen robots (e.g., A1, Mini Cheetah, Sirius) without robot-specific retraining. The work significantly reduces manual controller engineering for new quadrupedal platforms and suggests a path toward general-purpose robotic locomotion controllers, while noting limitations such as fixed DoFs and the potential benefits of more expressive architectures for broader generalization.
Abstract
Recent years have seen a surge in commercially-available and affordable quadrupedal robots, with many of these platforms being actively used in research and industry. As the availability of legged robots grows, so does the need for controllers that enable these robots to perform useful skills. However, most learning-based frameworks for controller development focus on training robot-specific controllers, a process that needs to be repeated for every new robot. In this work, we introduce a framework for training generalized locomotion (GenLoco) controllers for quadrupedal robots. Our framework synthesizes general-purpose locomotion controllers that can be deployed on a large variety of quadrupedal robots with similar morphologies. We present a simple but effective morphology randomization method that procedurally generates a diverse set of simulated robots for training. We show that by training a controller on this large set of simulated robots, our models acquire more general control strategies that can be directly transferred to novel simulated and real-world robots with diverse morphologies, which were not observed during training.
