Robot Parkour Learning
Ziwen Zhuang, Zipeng Fu, Jianren Wang, Christopher Atkeson, Soeren Schwertfeger, Chelsea Finn, Hang Zhao
TL;DR
<3-5 sentence high-level summary>We present an end-to-end, vision-based parkour system for low-cost quadrupeds that learns diverse skills (climb, leap, crawl, tilt, run) using a two-stage RL curriculum with soft dynamics constraints followed by hard dynamics constraints, then distills these into a single vision-based parkour policy via DAgger for onboard deployment. The method leverages privileged simulation information during training, depth-based vision, and automatic curricula to overcome difficult exploration, achieving robust sim-to-real transfer. Extensive simulation and real-world experiments on Unitree A1/Go1 demonstrate the robot autonomously selects and executes appropriate parkour skills in indoor and outdoor environments, with competitive success rates and high robustness. The work contributes an open-source framework, a principled two-stage learning approach, and strong empirical evidence that end-to-end vision-based parkour is feasible on affordable hardware.
Abstract
Parkour is a grand challenge for legged locomotion that requires robots to overcome various obstacles rapidly in complex environments. Existing methods can generate either diverse but blind locomotion skills or vision-based but specialized skills by using reference animal data or complex rewards. However, autonomous parkour requires robots to learn generalizable skills that are both vision-based and diverse to perceive and react to various scenarios. In this work, we propose a system for learning a single end-to-end vision-based parkour policy of diverse parkour skills using a simple reward without any reference motion data. We develop a reinforcement learning method inspired by direct collocation to generate parkour skills, including climbing over high obstacles, leaping over large gaps, crawling beneath low barriers, squeezing through thin slits, and running. We distill these skills into a single vision-based parkour policy and transfer it to a quadrupedal robot using its egocentric depth camera. We demonstrate that our system can empower two different low-cost robots to autonomously select and execute appropriate parkour skills to traverse challenging real-world environments.
