Multi-party Agent Relation Sampling for Multi-party Ad Hoc Teamwork
Beiwen Zhang, Yongheng Liang, Hejun Wu
TL;DR
The paper addresses the challenge of ad hoc teamwork in multi party settings where controlled agents must coordinate with multiple unfamiliar groups. It introduces MAHT and the MARS framework that builds a sparse agent skeleton and uses a Relational Forward Model to capture cross group dynamics while learning cooperative embeddings to condition policies. Empirical results on MPE and StarCraft II show that MARS achieves stronger coordination and faster convergence than representative MARL and AHT baselines, with ablations confirming the importance of the RFM and the skeleton. The work advances scalable, cross group coordination in open multi agent environments with practical implications for real world ad hoc collaboration.
Abstract
Multi-agent reinforcement learning (MARl) has achieved strong results in cooperative tasks but typically assumes fixed, fully controlled teams. Ad hoc teamwork (AHT) relaxes this by allowing collaboration with unknown partners, yet existing variants still presume shared conventions. We introduce Multil-party Ad Hoc Teamwork (MAHT), where controlled agents must coordinate with multiple mutually unfamiliar groups of uncontrolled teammates. To address this, we propose MARs, which builds a sparse skeleton graph and applies relational modeling to capture cross-group dvnamics. Experiments on MPE and starCralt ll show that MARs outperforms MARL and AHT baselines while converging faster.
