Aquarium: A Comprehensive Framework for Exploring Predator-Prey Dynamics through Multi-Agent Reinforcement Learning Algorithms
Michael Kölle, Yannick Erpelding, Fabian Ritz, Thomy Phan, Steffen Illium, Claudia Linnhoff-Popien
TL;DR
Aquarium presents a unified, open-source MARL environment for predator–prey research, integrating the PettingZoo framework to support reproducible, scalable experiments in a continuous 2D toroidal world. It delivers fully configurable dynamics, perception, and rewards, along with visualization and video recording to facilitate both quantitative benchmarking and qualitative analysis. Using PPO with GAE, the authors compare Individual Learning and Parameter Sharing policies, showing that shared-parameter training improves coordination and sample efficiency relative to independently trained agents, while strong baselines like TurnAway can outperform learned strategies in certain metrics. The work aims to standardize predator–prey MARL experimentation, enabling systematic investigations of swarm behaviors, social dilemmas, and emergent coordination with practical implications for multi-agent system design and ecological modelling.
Abstract
Recent advances in Multi-Agent Reinforcement Learning have prompted the modeling of intricate interactions between agents in simulated environments. In particular, the predator-prey dynamics have captured substantial interest and various simulations been tailored to unique requirements. To prevent further time-intensive developments, we introduce Aquarium, a comprehensive Multi-Agent Reinforcement Learning environment for predator-prey interaction, enabling the study of emergent behavior. Aquarium is open source and offers a seamless integration of the PettingZoo framework, allowing a quick start with proven algorithm implementations. It features physics-based agent movement on a two-dimensional, edge-wrapping plane. The agent-environment interaction (observations, actions, rewards) and the environment settings (agent speed, prey reproduction, predator starvation, and others) are fully customizable. Besides a resource-efficient visualization, Aquarium supports to record video files, providing a visual comprehension of agent behavior. To demonstrate the environment's capabilities, we conduct preliminary studies which use PPO to train multiple prey agents to evade a predator. In accordance to the literature, we find Individual Learning to result in worse performance than Parameter Sharing, which significantly improves coordination and sample-efficiency.
