CQLite: Communication-Efficient Multi-Robot Exploration Using Coverage-biased Distributed Q-Learning
Ehsan Latif, Ramviyas Parasuraman
TL;DR
The paper tackles the scalability challenge in multi-robot exploration by introducing CQLite, a distributed Q-learning framework that minimizes inter-robot communication through selective sharing of updated $Q$-values and ad-hoc map merging. It presents a coverage-biased reward structure and a Voronoi-based task partitioning strategy to coordinate exploration with limited data exchange, accompanied by theoretical convergence guarantees and time-complexity analysis. Extensive simulations in ROS/Gazebo and real-world Turtlebot3 experiments show that CQLite achieves faster convergence, larger coverage, and dramatically reduced communication and computation compared to RRT and DRL baselines. The work demonstrates practical benefits for resource-constrained, cooperative robotics applications and provides open-source ROS tooling for broader adoption.
Abstract
Frontier exploration and reinforcement learning have historically been used to solve the problem of enabling many mobile robots to autonomously and cooperatively explore complex surroundings. These methods need to keep an internal global map for navigation, but they do not take into consideration the high costs of communication and information sharing between robots. This study offers CQLite, a novel distributed Q-learning technique designed to minimize data communication overhead between robots while achieving rapid convergence and thorough coverage in multi-robot exploration. The proposed CQLite method uses ad hoc map merging, and selectively shares updated Q-values at recently identified frontiers to significantly reduce communication costs. The theoretical analysis of CQLite's convergence and efficiency, together with extensive numerical verification on simulated indoor maps utilizing several robots, demonstrates the method's novelty. With over 2x reductions in computation and communication alongside improved mapping performance, CQLite outperformed cutting-edge multi-robot exploration techniques like Rapidly Exploring Random Trees and Deep Reinforcement Learning. Related codes are open-sourced at \url{https://github.com/herolab-uga/cqlite}.
