Towards Robust Multi-UAV Collaboration: MARL with Noise-Resilient Communication and Attention Mechanisms
Zilin Zhao, Chishui Chen, Haotian Shi, Jiale Chen, Xuanlin Yue, Zhejian Yang, Yang Liu
TL;DR
The paper tackles robust, information-rich path planning for multiple UAVs in full 3D under noisy communications. It introduces a MARL framework based on Counterfactual Multi-Agent Policy Gradients (COMA), augmented with the SenDFuse Network for sensor denoising and fusion and an attention-enhanced CBAM-based Actor-Critic. Key contributions include extending NestFusion to multi-UAV fusion, offline pretraining of SenDFuse, and centralized training with a counterfactual baseline for credit assignment, achieving improved information gain in 3D IPP tasks. Experiments on synthetic and real-world data demonstrate superior path planning efficiency and noise robustness, including up to a 78% reduction in entropy, highlighting practical impact for scalable, robust UAV information collection.
Abstract
Efficient path planning for unmanned aerial vehicles (UAVs) is crucial in remote sensing and information collection. As task scales expand, the cooperative deployment of multiple UAVs significantly improves information collection efficiency. However, collaborative communication and decision-making for multiple UAVs remain major challenges in path planning, especially in noisy environments. To efficiently accomplish complex information collection tasks in 3D space and address robust communication issues, we propose a multi-agent reinforcement learning (MARL) framework for UAV path planning based on the Counterfactual Multi-Agent Policy Gradients (COMA) algorithm. The framework incorporates attention mechanism-based UAV communication protocol and training-deployment system, significantly improving communication robustness and individual decision-making capabilities in noisy conditions. Experiments conducted on both synthetic and real-world datasets demonstrate that our method outperforms existing algorithms in terms of path planning efficiency and robustness, especially in noisy environments, achieving a 78\% improvement in entropy reduction.
