Deep Meta Coordination Graphs for Multi-agent Reinforcement Learning
Nikunj Gupta, James Zachary Hare, Rajgopal Kannan, Viktor Prasanna
TL;DR
This work introduces Deep Meta Coordination Graphs (DMCG) to address cooperative MARL by jointly modeling higher-order and indirect agent interactions through dynamic meta coordination graphs. DMCG constructs multiple interaction-type graphs, selectively combines them, and applies graph convolutions to produce expressive agent representations that feed into a DCG-style value factorization with per-agent and pairwise terms. The approach effectively mitigates miscoordination and relative overgeneralization, delivering strong performance and sample efficiency on MACO tasks and scalable results on SMACv2. Overall, DMCG demonstrates the importance of learning adaptive, multi-hop interaction structures for robust, scalable coordination in complex multi-agent environments.
Abstract
This paper presents deep meta coordination graphs (DMCG) for learning cooperative policies in multi-agent reinforcement learning (MARL). Coordination graph formulations encode local interactions and accordingly factorize the joint value function of all agents to improve efficiency in MARL. However, existing approaches rely solely on pairwise relations between agents, which potentially oversimplifies complex multi-agent interactions. DMCG goes beyond these simple direct interactions by also capturing useful higher-order and indirect relationships among agents. It generates novel graph structures accommodating multiple types of interactions and arbitrary lengths of multi-hop connections in coordination graphs to model such interactions. It then employs a graph convolutional network module to learn powerful representations in an end-to-end manner. We demonstrate its effectiveness in multiple coordination problems in MARL where other state-of-the-art methods can suffer from sample inefficiency or fail entirely. All codes can be found here: https://github.com/Nikunj-Gupta/dmcg-marl.
