Collective Behavior Clone with Visual Attention via Neural Interaction Graph Prediction
Kai Li, Zhao Ma, Liang Li, Shiyu Zhao
TL;DR
The paper addresses the challenge of learning both local interaction mechanisms and the collective control policy of a swarm from trajectory data. It introduces CBC, which combines a Graph Variational Autoencoder (GVAE) to infer a time-varying interaction graph with behavioral cloning to learn the policy, complemented by a vision-based neighbor-selection module for decentralized operation. Key contributions include an enhanced GVAE that outperforms baselines in graph prediction and a real-world demonstration on a decentralized, vision-based robot swarm with no inter-robot communication, achieving lower action and trajectory errors. The results validate CBC as a practical framework for understanding swarm dynamics and enabling robust decentralized swarm robotics applications.
Abstract
In this paper, we propose a framework, collective behavioral cloning (CBC), to learn the underlying interaction mechanism and control policy of a swarm system. Given the trajectory data of a swarm system, we propose a graph variational autoencoder (GVAE) to learn the local interaction graph. Based on the interaction graph and swarm trajectory, we use behavioral cloning to learn the control policy of the swarm system. To demonstrate the practicality of CBC, we deploy it on a real-world decentralized vision-based robot swarm system. A visual attention network is trained based on the learned interaction graph for online neighbor selection. Experimental results show that our method outperforms previous approaches in predicting both the interaction graph and swarm actions with higher accuracy. This work offers a promising approach for understanding interaction mechanisms and swarm dynamics in future swarm robotics research. Code and data are available.
