Learning to Interact in World Latent for Team Coordination
Dongsu Lee, Daehee Lee, Yaru Niu, Honguk Woo, Amy Zhang, Ding Zhao
TL;DR
This paper addresses the challenge of coordinating multiple agents under partial observability by introducing Interactive World Latent (IWoL), a unified latent representation that encodes both inter-agent relations and task-relevant world information. IWoL learns this latent through a training-time graph-attention communication protocol and decoupled decoders (Interactive and World) that align the latent with coordination cues, while enabling two deployment modes: implicit (no messages at test time) and explicit (messages used by the policy). Across four robotics MARL benchmarks, IWoL variants consistently outperform strong baselines, demonstrate robustness to incomplete observations, and scale to large agent populations; ablations confirm the critical role of world and interactive decoders. The approach offers a simple, efficient drop-in solution for robust multi-agent coordination with potential for broad applicability and generalization in open-world MARL settings.
Abstract
This work presents a novel representation learning framework, interactive world latent (IWoL), to facilitate team coordination in multi-agent reinforcement learning (MARL). Building effective representation for team coordination is a challenging problem, due to the intricate dynamics emerging from multi-agent interaction and incomplete information induced by local observations. Our key insight is to construct a learnable representation space that jointly captures inter-agent relations and task-specific world information by directly modeling communication protocols. This representation, we maintain fully decentralized execution with implicit coordination, all while avoiding the inherent drawbacks of explicit message passing, e.g., slower decision-making, vulnerability to malicious attackers, and sensitivity to bandwidth constraints. In practice, our representation can be used not only as an implicit latent for each agent, but also as an explicit message for communication. Across four challenging MARL benchmarks, we evaluate both variants and show that IWoL provides a simple yet powerful key for team coordination. Moreover, we demonstrate that our representation can be combined with existing MARL algorithms to further enhance their performance.
