AC-MASAC: An Attentive Curriculum Learning Framework for Heterogeneous UAV Swarm Coordination
Wanhao Liu, Junhong Dai, Yixuan Zhang, Shengyun Yin, Panshuo Li
TL;DR
This work tackles cooperative path planning for heterogeneous UAV swarms under partial observability by framing the problem as a decentralized POMDP and introducing AC-MASAC. The method combines a heterogeneous attention-based actor-critic architecture with a structured curriculum and stage-aware replay to mitigate sparse rewards and catastrophic forgetting. Key contributions include a role-aware attention mechanism that models Leader–Follower dependencies, a curriculum learning strategy with hierarchical policy transfer and stage-proportional experience replay, and comprehensive experiments showing superior SR, FKR, and SMT against strong MARL baselines. The results demonstrate robust multi-agent coordination in dynamic environments and point to practical benefits for scalable UAV swarm control, with promising avenues for sim-to-real transfer.
Abstract
Cooperative path planning for heterogeneous UAV swarms poses significant challenges for Multi-Agent Reinforcement Learning (MARL), particularly in handling asymmetric inter-agent dependencies and addressing the risks of sparse rewards and catastrophic forgetting during training. To address these issues, this paper proposes an attentive curriculum learning framework (AC-MASAC). The framework introduces a role-aware heterogeneous attention mechanism to explicitly model asymmetric dependencies. Moreover, a structured curriculum strategy is designed, integrating hierarchical knowledge transfer and stage-proportional experience replay to address the issues of sparse rewards and catastrophic forgetting. The proposed framework is validated on a custom multi-agent simulation platform, and the results show that our method has significant advantages over other advanced methods in terms of Success Rate, Formation Keeping Rate, and Success-weighted Mission Time. The code is available at \textcolor{red}{https://github.com/Wanhao-Liu/AC-MASAC}.
