Resilient Topology-Aware Coordination for Dynamic 3D UAV Networks under Node Failure
Chuan-Chi Lai
TL;DR
The paper tackles resilience of dynamic 3D aerial-ground networks under sudden UAV node failures and proposes TAG-MAPPO, a topology-aware graph-based MARL framework with a TA-GAT critic and Random Observation Shuffling to enable autonomous topological reconfiguration. It demonstrates that topology-aware coordination reduces signaling overhead, enables rapid self-healing with over 90% restoration of pre-failure coverage within 15 time steps, and improves fairness in dense urban deployments. The approach achieves faster convergence and higher energy efficiency than MLP-based MAPPO and QMIX baselines across urban, suburban, and rural scenarios, highlighting the value of graph-based relational reasoning for volatile network topologies. The results indicate that incorporating topology intelligence is essential for robust, scalable 6G aerial networks and provides a foundation for adaptive deployments in highly dynamic environments.
Abstract
In 3D Aerial-Ground Integrated Networks (AGINs), ensuring continuous service coverage under unexpected hardware failures is critical for mission-critical applications. While Multi-Agent Reinforcement Learning (MARL) has shown promise in autonomous coordination, its resilience under sudden node failures remains a challenge due to dynamic topology deformation. This paper proposes a Topology-Aware Graph MAPPO (TAG-MAPPO) framework designed to enhance system survivability through autonomous 3D spatial reconfiguration. Our framework incorporates graph-based feature aggregation with a residual ego-state fusion mechanism to capture intricate inter-agent dependencies. This architecture enables the surviving swarm to rapidly adapt its topology compared to conventional Multi-Layer Perceptron (MLP) based approaches. Extensive simulations across heterogeneous environments, ranging from interference-limited Crowded Urban to sparse Rural areas, validate the proposed approach. The results demonstrate that TAG-MAPPO consistently outperforms baselines in both stability and efficiency; specifically, it reduces redundant handoffs by up to 50 percent while maintaining a lead in energy efficiency. Most notably, the framework exhibits exceptional self-healing capabilities following a catastrophic node failure. TAG-MAPPO restores over 90 percent of the pre-failure service coverage within 15 time steps, exhibiting a significantly faster V-shaped recovery trajectory than MLP baselines. Furthermore, in dense urban scenarios, the framework achieves a post-failure Jain's Fairness Index that even surpasses its original four-UAV configuration by effectively resolving service overlaps. These findings suggest that topology-aware coordination is essential for the realization of resilient 6G aerial networks and provides a robust foundation for adaptive deployments in volatile environments.
