Creating a Dynamic Quadrupedal Robotic Goalkeeper with Reinforcement Learning
Xiaoyu Huang, Zhongyu Li, Yanzhen Xiang, Yiming Ni, Yufeng Chi, Yunhao Li, Lizhi Yang, Xue Bin Peng, Koushil Sreenath
TL;DR
This work tackles the problem of enabling a quadrupedal robot to function as a soccer goalkeeper by integrating multiple dynamic locomotion skills with a high-level motion planner through a hierarchical reinforcement learning framework. The authors develop three skill-specific controllers (sidestep, dive, jump) parameterized by Bézier curves and a planning policy that selects the most suitable skill and trajectory based on ball and robot states, all validated with zero-shot sim-to-real transfer on a Mini Cheetah. Results show substantial improvements in interception performance over single-skill baselines, achieving up to 87.5% real-world saves and demonstrating the system's ability to handle a range of ball trajectories and speeds. The approach advances the feasibility of real-time, multi-skill, dynamic legged robotics in unstructured tasks like goalkeeping, with implications for broader autonomous sports robotics.
Abstract
We present a reinforcement learning (RL) framework that enables quadrupedal robots to perform soccer goalkeeping tasks in the real world. Soccer goalkeeping using quadrupeds is a challenging problem, that combines highly dynamic locomotion with precise and fast non-prehensile object (ball) manipulation. The robot needs to react to and intercept a potentially flying ball using dynamic locomotion maneuvers in a very short amount of time, usually less than one second. In this paper, we propose to address this problem using a hierarchical model-free RL framework. The first component of the framework contains multiple control policies for distinct locomotion skills, which can be used to cover different regions of the goal. Each control policy enables the robot to track random parametric end-effector trajectories while performing one specific locomotion skill, such as jump, dive, and sidestep. These skills are then utilized by the second part of the framework which is a high-level planner to determine a desired skill and end-effector trajectory in order to intercept a ball flying to different regions of the goal. We deploy the proposed framework on a Mini Cheetah quadrupedal robot and demonstrate the effectiveness of our framework for various agile interceptions of a fast-moving ball in the real world.
