The impact of behavioral diversity in multi-agent reinforcement learning
Matteo Bettini, Ryan Kortvelesy, Amanda Prorok
TL;DR
The paper investigates how behavioral diversity influences learning and performance in multi-agent reinforcement learning (MARL). It introduces System Neural Diversity (SND) to measure behavioral heterogeneity and Diversity Control (DiCo) to enforce a target diversity during training, enabling principled studies of heterogeneous teams. Across 2v2 and 5v5 soccer tasks, Pac-Men exploration, and Dynamic Passage resilience, constrained diversity yields emergent complementary roles (such as passing strategies and goalkeeping), improves coordination, accelerates exploration, and enhances resilience to disruptions. The findings suggest that diversity is a fundamental component of collective artificial learning, with tangible benefits over homogeneous training and potential implications for real-world multi-agent systems and lifelong learning.
Abstract
Many of the world's most pressing issues, such as climate change and global peace, require complex collective problem-solving skills. Recent studies indicate that diversity in individuals' behaviors is key to developing such skills and increasing collective performance. Yet behavioral diversity in collective artificial learning is understudied, with today's machine learning paradigms commonly favoring homogeneous agent strategies over heterogeneous ones, mainly due to computational considerations. In this work, we employ diversity measurement and control paradigms to study the impact of behavioral heterogeneity in several facets of multi-agent reinforcement learning. Through experiments in team play and other cooperative tasks, we show the emergence of unbiased behavioral roles that improve team outcomes; how behavioral diversity synergizes with morphological diversity; how diverse agents are more effective at finding cooperative solutions in sparse reward settings; and how behaviorally heterogeneous teams learn and retain latent skills to overcome repeated disruptions. Overall, our results indicate that, by controlling diversity, we can obtain non-trivial benefits over homogeneous training paradigms, demonstrating that diversity is a fundamental component of collective artificial learning, an insight thus far overlooked.
