Gradient-Informed Monte Carlo Fine-Tuning of Diffusion Models for Low-Thrust Trajectory Design
Jannik Graebner, Ryne Beeson
TL;DR
The paper addresses the challenge of identifying Pareto-optimal, low-thrust spacecraft trajectories in the CR3BP by recasting the search as sampling from a distribution over locally optimal costates. It integrates gradient-informed MCMC (MALA and HMC) with a diffusion-model self-supervised training loop, using analytic derivatives from state-transition matrices to compute target gradients efficiently. Empirical results on a Saturn-Titan transfer show that gradient-based samplers dramatically improve feasibility and Pareto-front coverage over prior RWM approaches, with MALA delivering the best trade-off between quality and computational cost. The final diffusion-model fine-tuning step enables scalable generation of high-quality samples and reveals the global structure of the solution space, promising faster, more scalable preliminary mission design.
Abstract
Preliminary mission design of low-thrust spacecraft trajectories in the Circular Restricted Three-Body Problem is a global search characterized by a complex objective landscape and numerous local minima. Formulating the problem as sampling from an unnormalized distribution supported on neighborhoods of locally optimal solutions, provides the opportunity to deploy Markov chain Monte Carlo methods and generative machine learning. In this work, we extend our previous self-supervised diffusion model fine-tuning framework to employ gradient-informed Markov chain Monte Carlo. We compare two algorithms - the Metropolis-Adjusted Langevin Algorithm and Hamiltonian Monte Carlo - both initialized from a distribution learned by a diffusion model. Derivatives of an objective function that balances fuel consumption, time of flight and constraint violations are computed analytically using state transition matrices. We show that incorporating the gradient drift term accelerates mixing and improves convergence of the Markov chain for a multi-revolution transfer in the Saturn-Titan system. Among the evaluated methods, MALA provides the best trade-off between performance and computational cost. Starting from samples generated by a baseline diffusion model trained on a related transfer, MALA explicitly targets Pareto-optimal solutions. Compared to a random walk Metropolis algorithm, it increases the feasibility rate from 17.34% to 63.01% and produces a denser, more diverse coverage of the Pareto front. By fine-tuning a diffusion model on the generated samples and associated reward values with reward-weighted likelihood maximization, we learn the global solution structure of the problem and eliminate the need for a tedious separate data generation phase.
