Transformers for Trajectory Optimization with Application to Spacecraft Rendezvous
Tommaso Guffanti, Daniele Gammelli, Simone D'Amico, Marco Pavone
TL;DR
The paper tackles autonomous spacecraft rendezvous trajectory optimization under hard constraints by introducing the Autonomous Rendezvous Transformer (ART), a Transformer-based framework that both forecasts near-optimal trajectories and serves as a warm-start for sequential convex programs solving constrained non-convex optimal control problems. By casting trajectory generation as a sequence-prediction task with performance-to-go and constraint-to-go signals, ART leverages offline data to predict state/control sequences and optionally propagate dynamics in-loop to enforce feasibility. The work demonstrates that ART can outperform recurrent baselines in forecasting fuel-optimal trajectories and substantially accelerate SCP convergence, yielding near-optimal, feasible solutions with runtimes competitive with convex optimization. This approach meaningfully narrows the gap between data-driven and optimization-based methods, enabling onboard capable autonomy for RPOD tasks while preserving hard constraint satisfaction. The study also lays out design choices, including trajectory representation, MDP formulation, and inference strategies, and points to robustness and broader trajectory-optimization applications as future directions.
Abstract
Reliable and efficient trajectory optimization methods are a fundamental need for autonomous dynamical systems, effectively enabling applications including rocket landing, hypersonic reentry, spacecraft rendezvous, and docking. Within such safety-critical application areas, the complexity of the emerging trajectory optimization problems has motivated the application of AI-based techniques to enhance the performance of traditional approaches. However, current AI-based methods either attempt to fully replace traditional control algorithms, thus lacking constraint satisfaction guarantees and incurring in expensive simulation, or aim to solely imitate the behavior of traditional methods via supervised learning. To address these limitations, this paper proposes the Autonomous Rendezvous Transformer (ART) and assesses the capability of modern generative models to solve complex trajectory optimization problems, both from a forecasting and control standpoint. Specifically, this work assesses the capabilities of Transformers to (i) learn near-optimal policies from previously collected data, and (ii) warm-start a sequential optimizer for the solution of non-convex optimal control problems, thus guaranteeing hard constraint satisfaction. From a forecasting perspective, results highlight how ART outperforms other learning-based architectures at predicting known fuel-optimal trajectories. From a control perspective, empirical analyses show how policies learned through Transformers are able to generate near-optimal warm-starts, achieving trajectories that are (i) more fuel-efficient, (ii) obtained in fewer sequential optimizer iterations, and (iii) computed with an overall runtime comparable to benchmarks based on convex optimization.
