Multi-Robot Cooperative Socially-Aware Navigation Using Multi-Agent Reinforcement Learning
Weizheng Wang, Le Mao, Ruiqi Wang, Byung-Cheol Min
TL;DR
The paper tackles multi-robot socially-aware navigation (MR-SAN) in pedestrian-rich environments with limited communication by formulating the problem as a Dec-POSMDP and introducing SAMARL, a MARL framework augmented with a hybrid spatial-temporal transformer to model human-robot and robot-robot interactions. It combines a ST-graph social interaction encoder with MAPPO training to enable decentralized execution of cooperative, socially compliant policies that respect realistic kinematics. The key contributions include a dynamics-aware Dec-POSMDP formulation, a transformer-based HRI/RRI representation network, and extensive simulation and real-world demonstrations showing superior performance over baselines and ablations. The work advances practical, scalable multi-robot coordination in human environments, with potential impact on collaborative delivery, exploration, and service robotics in public spaces.
Abstract
In public spaces shared with humans, ensuring multi-robot systems navigate without collisions while respecting social norms is challenging, particularly with limited communication. Although current robot social navigation techniques leverage advances in reinforcement learning and deep learning, they frequently overlook robot dynamics in simulations, leading to a simulation-to-reality gap. In this paper, we bridge this gap by presenting a new multi-robot social navigation environment crafted using Dec-POSMDP and multi-agent reinforcement learning. Furthermore, we introduce SAMARL: a novel benchmark for cooperative multi-robot social navigation. SAMARL employs a unique spatial-temporal transformer combined with multi-agent reinforcement learning. This approach effectively captures the complex interactions between robots and humans, thus promoting cooperative tendencies in multi-robot systems. Our extensive experiments reveal that SAMARL outperforms existing baseline and ablation models in our designed environment. Demo videos for this work can be found at: https://sites.google.com/view/samarl
