Safe and Economical UAV Trajectory Planning in Low-Altitude Airspace: A Hybrid DRL-LLM Approach with Compliance Awareness
Yanwei Gong, Junchao Fan, Ruichen Zhang, Dusit Niyato, Yingying Yao, Xiaolin Chang
TL;DR
The paper tackles UAV trajectory planning in dense low-altitude urban airspace where safety, regulatory compliance, and energy efficiency are all critical. It introduces a hybrid framework that combines soft actor-critic DRL with large language model reasoning, formulated as a POMDP with a composite reward to encourage robust obstacle avoidance, compliance, and economical operation. The LLM provides semantic, rule-aware guidance during training when obstacles are detected, while the deployment phase relies on the trained SAC policy to ensure real-time performance; computational complexity accounts for occasional LLM inferences during training. Experimental results demonstrate that the approach improves data collection rate, reduces collisions and regulatory violations to near zero, achieves 100% successful landings, and lowers energy consumption relative to baselines. This work offers a practical pathway to scalable, compliant, and energy-efficient UAV data collection in the evolving low-altitude economy.
Abstract
The rapid growth of the low-altitude economy has driven the widespread adoption of unmanned aerial vehicles (UAVs). This growing deployment presents new challenges for UAV trajectory planning in complex urban environments. However, existing studies often overlook key factors, such as urban airspace constraints and economic efficiency, which are essential in low-altitude economy contexts. Deep reinforcement learning (DRL) is regarded as a promising solution to these issues, while its practical adoption remains limited by low learning efficiency. To overcome this limitation, we propose a novel UAV trajectory planning framework that combines DRL with large language model (LLM) reasoning to enable safe, compliant, and economically viable path planning. Experimental results demonstrate that our method significantly outperforms existing baselines across multiple metrics, including data collection rate, collision avoidance, successful landing, regulatory compliance, and energy efficiency. These results validate the effectiveness of our approach in addressing UAV trajectory planning key challenges under constraints of the low-altitude economy networking.
