Investigating Relational State Abstraction in Collaborative MARL
Sharlin Utke, Jeremie Houssineau, Giovanni Montana
TL;DR
This work tackles sample efficiency in collaborative MARL by introducing MARC, a simple yet effective relational critic that encodes observations as a spatial graph and processes them with a relational graph neural network. By exploiting translation-invariant spatial relations and a shared observation encoder within a centralized training/decentralized execution framework, MARC improves both sample efficiency and asymptotic performance across six spatially demanding tasks, including heterogeneous agents and a continuous-domain task. Extensive ablations show the default spatial relation set provides strong inductive bias, while overly fine-grained or fully dense graphs can hinder efficiency; MARC also demonstrates robust generalization to varying agent and object configurations. The findings highlight relational state abstraction as a practical avenue for more efficient MARL in spatially complex environments, with potential extensions to richer inductive biases and interpretability for real-world deployment.
Abstract
This paper explores the impact of relational state abstraction on sample efficiency and performance in collaborative Multi-Agent Reinforcement Learning. The proposed abstraction is based on spatial relationships in environments where direct communication between agents is not allowed, leveraging the ubiquity of spatial reasoning in real-world multi-agent scenarios. We introduce MARC (Multi-Agent Relational Critic), a simple yet effective critic architecture incorporating spatial relational inductive biases by transforming the state into a spatial graph and processing it through a relational graph neural network. The performance of MARC is evaluated across six collaborative tasks, including a novel environment with heterogeneous agents. We conduct a comprehensive empirical analysis, comparing MARC against state-of-the-art MARL baselines, demonstrating improvements in both sample efficiency and asymptotic performance, as well as its potential for generalization. Our findings suggest that a minimal integration of spatial relational inductive biases as abstraction can yield substantial benefits without requiring complex designs or task-specific engineering. This work provides insights into the potential of relational state abstraction to address sample efficiency, a key challenge in MARL, offering a promising direction for developing more efficient algorithms in spatially complex environments.
