RobocupGym: A challenging continuous control benchmark in Robocup
Michael Beukman, Branden Ingram, Geraud Nangue Tasse, Benjamin Rosman, Pravesh Ranchod
TL;DR
This work introduces RobocupGym, an open-source library that links the Simspark-based Robocup 3D simulator with Stable Baselines 3 to create continuous-control RL benchmarks in robotic soccer. It provides premade tasks (SimpleKick and VelocityKick), a modular architecture for adding new tasks, and a Gymnasium-compatible interface to enable standard RL workflows and parallel training. Initial results show PPO and SAC can learn kicking behaviors, with PPO often outperforming SAC and parallelism reducing training time. By delivering a realistic, extendable robotics benchmark, RobocupGym enables practical RL research in high-dimensional, real-world-like control and paves the way for more sophisticated multi-task and multi-agent RL in robotic football.
Abstract
Reinforcement learning (RL) has progressed substantially over the past decade, with much of this progress being driven by benchmarks. Many benchmarks are focused on video or board games, and a large number of robotics benchmarks lack diversity and real-world applicability. In this paper, we aim to simplify the process of applying reinforcement learning in the 3D simulation league of Robocup, a robotic football competition. To this end, we introduce a Robocup-based RL environment based on the open source rcssserver3d soccer server, simple pre-defined tasks, and integration with a popular RL library, Stable Baselines 3. Our environment enables the creation of high-dimensional continuous control tasks within a robotics football simulation. In each task, an RL agent controls a simulated Nao robot, and can interact with the ball or other agents. We open-source our environment and training code at https://github.com/Michael-Beukman/RobocupGym.
