LPAC: Learnable Perception-Action-Communication Loops with Applications to Coverage Control
Saurav Agarwal, Ramya Muthukrishnan, Walker Gosrich, Vijay Kumar, Alejandro Ribeiro
TL;DR
The paper tackles decentralized coverage control for robot swarms operating in environments with an unknown, static importance density field Φ. It introduces LPAC, a Learnable Perception-Action-Communication loop that combines a CNN-based perception module, a Graph Neural Network-based communication module, and a shallow MLP action module, trained via imitation learning from a clairvoyant CVT planner. Results show LPAC outperforms both centralized and decentralized CVT baselines, generalizes to larger numbers of robots and features, transfers to larger environments without retraining, and remains robust to position noise, real-world data, and sim-to-real demonstrations. These findings support the viability of learnable PAC architectures for scalable, robust, and transferable decentralized navigation in robot swarms, with potential applicability to other multi-agent reasoning tasks.
Abstract
Coverage control is the problem of navigating a robot swarm to collaboratively monitor features or a phenomenon of interest not known a priori. The problem is challenging in decentralized settings with robots that have limited communication and sensing capabilities. We propose a learnable Perception-Action-Communication (LPAC) architecture for the problem, wherein a convolutional neural network (CNN) processes localized perception; a graph neural network (GNN) facilitates robot communications; finally, a shallow multi-layer perceptron (MLP) computes robot actions. The GNN enables collaboration in the robot swarm by computing what information to communicate with nearby robots and how to incorporate received information. Evaluations show that the LPAC models -- trained using imitation learning -- outperform standard decentralized and centralized coverage control algorithms. The learned policy generalizes to environments different from the training dataset, transfers to larger environments with more robots, and is robust to noisy position estimates. The results indicate the suitability of LPAC architectures for decentralized navigation in robot swarms to achieve collaborative behavior.
