Trajectory Design for UAV-Based Low-Altitude Wireless Networks in Unknown Environments: A Digital Twin-Assisted TD3 Approach
Jihao Luo, Zesong Fei, Xinyi Wang, Le Zhao, Yuanhao Cui, Guangxu Zhu, Dusit Niyato
TL;DR
The paper tackles UAV trajectory design for low-altitude wireless networks in unknown environments by coupling a digital twin–assisted training framework with a SATD3TD trajectory design. It combines offline VE–driven learning in a DT with online VE updates to guide safe, efficient flight decisions, in a TDMA/ISAC setting. The main contributions are (i) the DTTDF that accelerates training and ensures safety, and (ii) the SATD3TD scheme that integrates simulated annealing for initial scheduling with TD3 for continuous trajectory optimization, yielding faster convergence, fewer collisions, and shorter mission times than baselines. The proposed approach enhances reliability and efficiency for LAWN deployments in uncertain settings, with potential to scale to larger UAV fleets and dynamic ground-user demands.
Abstract
Unmanned aerial vehicles (UAVs) are emerging as key enablers for low-altitude wireless network (LAWN), particularly when terrestrial networks are unavailable. In such scenarios, the environmental topology is typically unknown; hence, designing efficient and safe UAV trajectories is essential yet challenging. To address this, we propose a digital twin (DT)-assisted training and deployment framework. In this framework, the UAV transmits integrated sensing and communication signals to provide communication services to ground users, while simultaneously collecting echoes that are uploaded to the DT server to progressively construct virtual environments (VEs). These VEs accelerate model training and are continuously updated with real-time UAV sensing data during deployment, supporting decision-making and enhancing flight safety. Based on this framework, we further develop a trajectory design scheme that integrates simulated annealing for efficient user scheduling with the twin-delayed deep deterministic policy gradient algorithm for continuous trajectory design, aiming to minimize mission completion time while ensuring obstacle avoidance. Simulation results demonstrate that the proposed approach achieves faster convergence, higher flight safety, and shorter mission completion time compared with baseline methods, providing a robust and efficient solution for LAWN deployment in unknown environments.
