Collision Avoidance for Multiple UAVs in Unknown Scenarios with Causal Representation Disentanglement
Jiafan Zhuang, Zihao Xia, Gaofei Han, Boxi Wang, Wenji Li, Dongliang Wang, Zhifeng Hao, Ruichu Cai, Zhun Fan
TL;DR
The paper tackles the generalization gap in DRL-based multi-UAV collision avoidance caused by non-causal visual factors. It introduces causal representation disentanglement with a background intervention to separate task-relevant causal factors $S$ from non-causal factors $U$, feeding only $Z_2$ and $Z_3$ to the policy. The approach combines a VAE-based representation with losses $\\mathcal{L}_{vae}$, $\mathcal{L}_{rec}$, and $\mathcal{L}_C$, plus Fourier-domain background perturbations to create multi-domain data. Experiments in Unreal Engine/AirSim show robust generalization to unseen environments (grassland, snow mountain, forest) and initialization patterns, with clear improvements in Success Rate and SPL over the SAC+RAE baseline, demonstrating practical impact for reliable UAV operation in unknown scenarios.
Abstract
Deep reinforcement learning (DRL) has achieved remarkable progress in online path planning tasks for multi-UAV systems. However, existing DRL-based methods often suffer from performance degradation when tackling unseen scenarios, since the non-causal factors in visual representations adversely affect policy learning. To address this issue, we propose a novel representation learning approach, \ie, causal representation disentanglement, which can identify the causal and non-causal factors in representations. After that, we only pass causal factors for subsequent policy learning and thus explicitly eliminate the influence of non-causal factors, which effectively improves the generalization ability of DRL models. Experimental results show that our proposed method can achieve robust navigation performance and effective collision avoidance especially in unseen scenarios, which significantly outperforms existing SOTA algorithms.
