Reinforcement Learning Driven Multi-Robot Exploration via Explicit Communication and Density-Based Frontier Search
Gabriele Calzolari, Vidya Sumathy, Christoforos Kanellakis, George Nikolakopoulos
TL;DR
The paper tackles scalable, robust exploration by multiple heterogeneous robots under partial observability and limited communication. It introduces a decentralized CTDE RL framework using an agent-centered FOV occupancy grid, A$^*$-informed frontier features, and a constrained, proximity-based data-sharing mechanism, optimized with HAPPO and a shared critic. Two reward schemes encourage data sharing and frontier-driven exploration, with comprehensive simulations in Gymnasium and real-robot ROS2 trials demonstrating faster discovery and reduced redundancy. The approach substantially reduces exploration time via inter-agent map sharing and shows feasibility for real-world heterogeneous deployments, with avenues for extending to broader platforms.
Abstract
Collaborative multi-agent exploration of unknown environments is crucial for search and rescue operations. Effective real-world deployment must address challenges such as limited inter-agent communication and static and dynamic obstacles. This paper introduces a novel decentralized collaborative framework based on Reinforcement Learning to enhance multi-agent exploration in unknown environments. Our approach enables agents to decide their next action using an agent-centered field-of-view occupancy grid, and features extracted from $\text{A}^*$ algorithm-based trajectories to frontiers in the reconstructed global map. Furthermore, we propose a constrained communication scheme that enables agents to share their environmental knowledge efficiently, minimizing exploration redundancy. The decentralized nature of our framework ensures that each agent operates autonomously, while contributing to a collective exploration mission. Extensive simulations in Gymnasium and real-world experiments demonstrate the robustness and effectiveness of our system, while all the results highlight the benefits of combining autonomous exploration with inter-agent map sharing, advancing the development of scalable and resilient robotic exploration systems.
