Data-driven Spatial Classification using Multi-Arm Bandits for Monitoring with Energy-Constrained Mobile Robots
Xiaoshan Lin, Siddharth Nayak, Stefano Di Cairano, Abraham P. Vinod
TL;DR
This work addresses rapid spatial classification for monitoring with a heterogeneous, energy-constrained team of mobile sensors and charging stations. It introduces a bi-level framework that uses a bandit-based high-level planner to select epoch goals from the candidate set $\mathscr{C}$ and an IP-based low-level planner to coordinate collision-free trajectories under energy constraints, with collision avoidance refined by a linear assignment step. The approach provides anytime guarantees and finite-time bounds, and its effectiveness is demonstrated through hardware experiments with drones and ground robots as well as extensive simulations in noisy settings. The results indicate strong practical viability for applications such as search-and-rescue and environmental monitoring, offering scalable, data-driven planning under uncertainty.
Abstract
We consider the spatial classification problem for monitoring using data collected by a coordinated team of mobile robots. Such classification problems arise in several applications including search-and-rescue and precision agriculture. Specifically, we want to classify the regions of a search environment into interesting and uninteresting as quickly as possible using a team of mobile sensors and mobile charging stations. We develop a data-driven strategy that accommodates the noise in sensed data and the limited energy capacity of the sensors, and generates collision-free motion plans for the team. We propose a bi-level approach, where a high-level planner leverages a multi-armed bandit framework to determine the potential regions of interest for the drones to visit next based on the data collected online. Then, a low-level path planner based on integer programming coordinates the paths for the team to visit the target regions subject to the physical constraints. We characterize several theoretical properties of the proposed approach, including anytime guarantees and task completion time. We show the efficacy of our approach in simulation, and further validate these observations in physical experiments using mobile robots.
