Robust and Efficient Communication in Multi-Agent Reinforcement Learning

Zejiao Liu; Yi Li; Jiali Wang; Junqi Tu; Yitian Hong; Fangfei Li; Yang Liu; Toshiharu Sugawara; Yang Tang

Robust and Efficient Communication in Multi-Agent Reinforcement Learning

Zejiao Liu, Yi Li, Jiali Wang, Junqi Tu, Yitian Hong, Fangfei Li, Yang Liu, Toshiharu Sugawara, Yang Tang

TL;DR

This survey addresses the gap between theoretical MARL models and real-world deployments by focusing on robust and bandwidth-efficient communication under non-ideal conditions, such as perturbations, delays, and bandwidth limits. It reviews foundational problem representations (Dec-POMDPs and Markov games), then surveys robustness against observation and message perturbations, along with delay-aware and bandwidth-aware learning frameworks. The authors highlight three key application domains—cooperative autonomous driving, distributed SLAM, and federated learning—to illustrate practical challenges and solutions, including adversarial defenses, information bottlenecks, and dynamic scheduling. The work advocates a unified, co-design approach across communication, learning, and robustness to bridge theory and practice, and outlines open challenges in security, delays, cross-layer optimization, and the use of large models for interpretable communication. The practical impact lies in guiding researchers toward designing MARL systems that perform reliably and efficiently in real-world, bandwidth-constrained, and adversarial environments.

Abstract

Multi-agent reinforcement learning (MARL) has made significant strides in enabling coordinated behaviors among autonomous agents. However, most existing approaches assume that communication is instantaneous, reliable, and has unlimited bandwidth; these conditions are rarely met in real-world deployments. This survey systematically reviews recent advances in robust and efficient communication strategies for MARL under realistic constraints, including message perturbations, transmission delays, and limited bandwidth. Furthermore, because the challenges of low-latency reliability, bandwidth-intensive data sharing, and communication-privacy trade-offs are central to practical MARL systems, we focus on three applications involving cooperative autonomous driving, distributed simultaneous localization and mapping, and federated learning. Finally, we identify key open challenges and future research directions, advocating a unified approach that co-designs communication, learning, and robustness to bridge the gap between theoretical MARL models and practical implementations.

Robust and Efficient Communication in Multi-Agent Reinforcement Learning

TL;DR

Abstract

Robust and Efficient Communication in Multi-Agent Reinforcement Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)