Shared Spatial Memory Through Predictive Coding
Zhengru Fang, Yu Guo, Jingjing Wang, Yuang Zhang, Haonan An, Yinhai Wang, Yuguang Fang
TL;DR
The work tackles how multiple agents build and maintain a shared spatial memory under partial observability and strict bandwidth constraints. It introduces a unified predictive coding framework with three levels: an internal grid-cell–like metric for self-localization, bandwidth-aware social communication via a variational information bottleneck, and a hierarchical policy (HRL-ICM) that actively explores to minimize joint uncertainty. Empirically, grid-cell–like representations spontaneously emerge from self-motion prediction, social place cells arise as partner-sensitive encoders, and memory-efficient communication supports robust cooperative navigation on Memory-Maze under severe bandwidth reductions (e.g., 73.5% success at 128 bits/step down to 64.4% at 4 bits/step, outperforming full-broadcast baselines). The results offer a principled, biologically plausible mechanism by which predictive loss drives the emergence of shared spatial memory and collective intelligence in multi-agent systems.
Abstract
Constructing a consistent shared spatial memory is a critical challenge in multi-agent systems, where partial observability and limited bandwidth often lead to catastrophic failures in coordination. We introduce a multi-agent predictive coding framework that formulates coordination as the minimization of mutual uncertainty among agents. Through an information bottleneck objective, this framework prompts agents to learn not only who and what to communicate but also when. At the foundation of this framework lies a grid-cell-like metric as internal spatial coding for self-localization, emerging spontaneously from self-supervised motion prediction. Building upon this internal spatial code, agents gradually develop a bandwidth-efficient communication mechanism and specialized neural populations that encode partners' locations-an artificial analogue of hippocampal social place cells (SPCs). These social representations are further utilized by a hierarchical reinforcement learning policy that actively explores to reduce joint uncertainty. On the Memory-Maze benchmark, our approach shows exceptional resilience to bandwidth constraints: success degrades gracefully from 73.5% to 64.4% as bandwidth shrinks from 128 to 4 bits/step, whereas a full-broadcast baseline collapses from 67.6% to 28.6%. Our findings establish a theoretically principled and biologically plausible basis for how complex social representations emerge from a unified predictive drive, leading to collective intelligence.
