Learning to Imitate Spatial Organization in Multi-robot Systems
Ayomide O. Agunloye, Sarvapali D. Ramchurn, Mohammad D. Soorati
TL;DR
This work tackles reconstructing swarm collective behavior without access to swarm controllers by transforming expert demonstrations into informative multi-agent interaction features and recovering policies with Multi-Agent GAIL (MA-GAIL) using multiple discriminators and demonstration sharing. The method is evaluated on three spatial organization tasks (aggregation, homing, obstacle avoidance) by separating the environment into motion and control layers within a DEC-POMDP framework, achieving near-expert performance and demonstrating the value of environment-aware representations. Key findings include improved learning representation through feature transformation and robust imitation across diverse scenarios, while noting limitations in highly sparse-reward tasks and scalability with many discriminators. The work advances practical swarm analysis and testing by enabling behavior reconstruction and verification without direct access to the original swarm controllers.
Abstract
Understanding collective behavior and how it evolves is important to ensure that robot swarms can be trusted in a shared environment. One way to understand the behavior of the swarm is through collective behavior reconstruction using prior demonstrations. Existing approaches often require access to the swarm controller which may not be available. We reconstruct collective behaviors in distinct swarm scenarios involving shared environments without using swarm controller information. We achieve this by transforming prior demonstrations into features that describe multi-agent interactions before behavior reconstruction with multi-agent generative adversarial imitation learning (MA-GAIL). We show that our approach outperforms existing algorithms in spatial organization, and can be used to observe and reconstruct a swarm's behavior for further analysis and testing, which might be impractical or undesirable on the original robot swarm.
