Multi-Agent Synchronization Tasks
Rolando Fernandez, Garrett Warnell, Derrik E. Asher, Peter Stone
TL;DR
This work defines Multi-Agent Synchronization Tasks (MSTs) as a Dec-POMDP subclass where each agent's action space is partitioned into synchronization and neutral actions, introducing high-stakes coordination via sets $\mathbb{A}^+(s)$ and $\mathbb{A}^-(s)$. It presents Synchronized Predator-Prey as a concrete MST benchmark and evaluates state-of-the-art coordination-graph methods (DCG, DICG, QGNN) across varying team sizes and action heterogeneity. Results indicate that DCG offers the strongest performance among evaluated methods but cannot scale beyond 2-agent sub-teams or handle heterogeneous capture actions, while DICG and QGNN generally fail under these MST conditions; removing the miscapture penalty eliminates the MST constraint, enabling non-MST solutions. These findings challenge the applicability of current CG/GNN-based MARL approaches to complex coordination tasks and motivate further research into scalable, representation-rich message-passing frameworks for MST-like coordination. The paper contributes a formal MST framework and a tangible benchmark to guide future development of robust coordination strategies in multi-agent systems.
Abstract
In multi-agent reinforcement learning (MARL), coordination plays a crucial role in enhancing agents' performance beyond what they could achieve through cooperation alone. The interdependence of agents' actions, coupled with the need for communication, leads to a domain where effective coordination is crucial. In this paper, we introduce and define $\textit{Multi-Agent Synchronization Tasks}$ (MSTs), a novel subset of multi-agent tasks. We describe one MST, that we call $\textit{Synchronized Predator-Prey}$, offering a detailed description that will serve as the basis for evaluating a selection of recent state-of-the-art (SOTA) MARL algorithms explicitly designed to address coordination challenges through the use of communication strategies. Furthermore, we present empirical evidence that reveals the limitations of the algorithms assessed to solve MSTs, demonstrating their inability to scale effectively beyond 2-agent coordination tasks in scenarios where communication is a requisite component. Finally, the results raise questions about the applicability of recent SOTA approaches for complex coordination tasks (i.e. MSTs) and prompt further exploration into the underlying causes of their limitations in this context.
