CGD: Constraint-Guided Diffusion Policies for UAV Trajectory Planning
Kota Kondo, Andrea Tagliabue, Xiaoyi Cai, Claudius Tewari, Olivia Garcia, Marcos Espitia-Alvarez, Jonathan P. How
TL;DR
This work introduces Constraint-Guided Diffusion (CGD), an imitation-learning framework for UAV trajectory planning that combines diffusion-policy-based path generation with a surrogate optimization loop to enforce constraint satisfaction. By decomposing the original non-convex problem into a collision-avoidance subproblem refined by a diffusion model and a separate time-parametrization subproblem guided by constraint gradients, CGD achieves collision-free, dynamically feasible trajectories under deployment-time constraints that may differ from training. The approach leverages a diffusion model trained from an optimization-based expert (PANTHER*) and augments it with a Quadratic Program (QP) to enforce dynamics, a time-guide to adjust trajectory duration, a goal-conditioning mechanism, and an obstacle-collision guide, enabling robust performance in both in-distribution and out-of-distribution scenarios. Experimental results show CGD outperforms a Deep-PANTHER-like MLP baseline in terms of multimodal trajectory capture, constraint satisfaction, and computation time, with promising generalization to tighter constraints and unseen goals, suggesting practical viability for real-time UAV planning and potential extension to multiagent settings.
Abstract
Traditional optimization-based planners, while effective, suffer from high computational costs, resulting in slow trajectory generation. A successful strategy to reduce computation time involves using Imitation Learning (IL) to develop fast neural network (NN) policies from those planners, which are treated as expert demonstrators. Although the resulting NN policies are effective at quickly generating trajectories similar to those from the expert, (1) their output does not explicitly account for dynamic feasibility, and (2) the policies do not accommodate changes in the constraints different from those used during training. To overcome these limitations, we propose Constraint-Guided Diffusion (CGD), a novel IL-based approach to trajectory planning. CGD leverages a hybrid learning/online optimization scheme that combines diffusion policies with a surrogate efficient optimization problem, enabling the generation of collision-free, dynamically feasible trajectories. The key ideas of CGD include dividing the original challenging optimization problem solved by the expert into two more manageable sub-problems: (a) efficiently finding collision-free paths, and (b) determining a dynamically-feasible time-parametrization for those paths to obtain a trajectory. Compared to conventional neural network architectures, we demonstrate through numerical evaluations significant improvements in performance and dynamic feasibility under scenarios with new constraints never encountered during training.
