Multi-UAV Speed Control with Collision Avoidance and Handover-aware Cell Association: DRL with Action Branching
Zijiang Yan, Wael Jaafar, Bassant Selim, Hina Tabassum
TL;DR
This work tackles the challenge of jointly optimizing multi-UAV speed control and cell association on a 3D aerial highway to improve traffic flow, connectivity, and handover handling. It introduces a branching deep Q-network framework (BDQ/BDDQN) that decomposes the high-dimensional action space into transportation and telecommunication branches, leveraging a shared representation to coordinate decisions. The approach yields improved transportation and communication performance, reduces handover rates (with BDDQN achieving sub-1% HO rates after extensive training at moderate speeds), and demonstrates an effective trade-off when varying the number of available BSs. The study advances practical deployment of cooperative multi-UAV networks by incorporating collision avoidance, lane-changing dynamics, and HO-aware data-rate optimization.
Abstract
This paper presents a deep reinforcement learning solution for optimizing multi-UAV cell-association decisions and their moving velocity on a 3D aerial highway. The objective is to enhance transportation and communication performance, including collision avoidance, connectivity, and handovers. The problem is formulated as a Markov decision process (MDP) with UAVs' states defined by velocities and communication data rates. We propose a neural architecture with a shared decision module and multiple network branches, each dedicated to a specific action dimension in a 2D transportation-communication space. This design efficiently handles the multi-dimensional action space, allowing independence for individual action dimensions. We introduce two models, Branching Dueling Q-Network (BDQ) and Branching Dueling Double Deep Q-Network (Dueling DDQN), to demonstrate the approach. Simulation results show a significant improvement of 18.32% compared to existing benchmarks.
