Exponential Topology-enabled Scalable Communication in Multi-agent Reinforcement Learning
Xinran Li, Xiaolu Wang, Chenjia Bai, Jun Zhang
TL;DR
This work tackles scalable communication in cooperative multi-agent reinforcement learning under partial observability by designing ExpoComm, a protocol that leverages exponential graph topologies to achieve fast information diffusion with near-linear communication costs. It integrates memory-based message processors and auxiliary grounding tasks to ensure messages reflect global information and aid decision-making, addressing the inefficiencies of pairwise connectivity in large-scale systems. Across twelve large-scale scenarios in MAgent and Infrastructure Management Planning, ExpoComm demonstrates superior performance and robust zero-shot transfer to larger agent counts, with the one-peer variant often delivering the best trade-off between performance and communication budget. The approach offers practical impact for real-world, many-agent systems by enabling scalable, globally informed coordination without prohibitive communication overhead, and its open-source code facilitates adoption and further research. The key ideas are formalized around an $\mathcal{G}^t$ topology with diameter $\lceil \log_2(N-1)\rceil$ and edge count scaling near linearly with $N$, alongside grounding losses that align local messages with global information via $\mathcal{L}^{\text{Aux}}_{\text{pred}}$ or InfoNCE, depending on state availability.
Abstract
In cooperative multi-agent reinforcement learning (MARL), well-designed communication protocols can effectively facilitate consensus among agents, thereby enhancing task performance. Moreover, in large-scale multi-agent systems commonly found in real-world applications, effective communication plays an even more critical role due to the escalated challenge of partial observability compared to smaller-scale setups. In this work, we endeavor to develop a scalable communication protocol for MARL. Unlike previous methods that focus on selecting optimal pairwise communication links-a task that becomes increasingly complex as the number of agents grows-we adopt a global perspective on communication topology design. Specifically, we propose utilizing the exponential topology to enable rapid information dissemination among agents by leveraging its small-diameter and small-size properties. This approach leads to a scalable communication protocol, named ExpoComm. To fully unlock the potential of exponential graphs as communication topologies, we employ memory-based message processors and auxiliary tasks to ground messages, ensuring that they reflect global information and benefit decision-making. Extensive experiments on large-scale cooperative benchmarks, including MAgent and Infrastructure Management Planning, demonstrate the superior performance and robust zero-shot transferability of ExpoComm compared to existing communication strategies. The code is publicly available at https://github.com/LXXXXR/ExpoComm.
