Decentralized Collective World Model for Emergent Communication and Coordination
Kentaro Nomura, Tatsuya Aoki, Tadahiro Taniguchi, Takato Horii
TL;DR
This work tackles the challenge of enabling decentralized multi-agent systems to simultaneously learn shared symbol systems and coordinate under partial observability. It proposes a CPC-based, fully decentralized world model where two agents exchange bidirectional messages and optimize a collective free-energy objective that includes a collective regularization term, with CR approximated via a Product-of-Experts approach and message alignment via InfoNCE. The approach yields environment-general symbolic representations that reflect global environmental dynamics and improve coordination, especially when agents have heterogeneous perceptual capabilities. However, the coordination gap relative to a centralized or fully observable baseline highlights limitations in policy learning (demonstrated via imitation learning) and points to future work incorporating reinforcement learning, active inference, and scalability to larger agent teams. Overall, the results demonstrate that decentralized communication can foster meaningful symbol emergence while supporting coordination in dynamic, partially observable environments.
Abstract
We propose a fully decentralized multi-agent world model that enables both symbol emergence for communication and coordinated behavior through temporal extension of collective predictive coding. Unlike previous research that focuses on either communication or coordination separately, our approach achieves both simultaneously. Our method integrates world models with communication channels, enabling agents to predict environmental dynamics, estimate states from partial observations, and share critical information through bidirectional message exchange with contrastive learning for message alignment. Using a two-agent trajectory drawing task, we demonstrate that our communication-based approach outperforms non-communicative models when agents have divergent perceptual capabilities, achieving the second-best coordination after centralized models. Importantly, our decentralized approach with constraints preventing direct access to other agents' internal states facilitates the emergence of more meaningful symbol systems that accurately reflect environmental states. These findings demonstrate the effectiveness of decentralized communication for supporting coordination while developing shared representations of the environment.
