Learning Multiagent Communication with Backpropagation
Sainbayar Sukhbaatar, Arthur Szlam, Rob Fergus
TL;DR
The paper introduces CommNet, a differentiable framework enabling continuous communication among co-operating agents under partial observability and dynamic team sizes. By learning both control and inter-agent messaging through backpropagation, the model achieves improved coordination across diverse tasks, including lever-pulling, traffic management, combat, and reasoning challenges like bAbI. Key contributions include a permutation-invariant broadcast-communication mechanism, several architectural extensions (local connectivity, skip connections, temporal recurrence), and empirical evidence that learned communication can be sparse yet meaningful. The results indicate practical benefits for multi-agent systems and offer insights into interpretable communication strategies, with future work aimed at heterogenous agents and scaling to larger teams.
Abstract
Many tasks in AI require the collaboration of multiple agents. Typically, the communication protocol between agents is manually specified and not altered during training. In this paper we explore a simple neural model, called CommNet, that uses continuous communication for fully cooperative tasks. The model consists of multiple agents and the communication between them is learned alongside their policy. We apply this model to a diverse set of tasks, demonstrating the ability of the agents to learn to communicate amongst themselves, yielding improved performance over non-communicative agents and baselines. In some cases, it is possible to interpret the language devised by the agents, revealing simple but effective strategies for solving the task at hand.
