Transformer-Based Fault-Tolerant Control for Fixed-Wing UAVs Using Knowledge Distillation and In-Context Adaptation
Francisco Giral, Ignacio Gómez, Ricardo Vinuesa, Soledad Le Clainche
TL;DR
The paper tackles fault-tolerant control for fixed-wing UAVs under large dynamical changes by replacing traditional inner-loop control with a transformer that maps outer-loop references $h_{ref}$, $heading_{ref}$, and $V_{T,ref}$ directly to control actions. It uses a teacher–student framework: a privileged DreamerV3-based agent trains on full observability, and a 0.8M-parameter Decision Transformer learns from offline expert trajectories under partial observability, leveraging in-context learning to adapt to failures without explicit fault detection. A Neural Lyapunov function $V_ heta(s)$ learned via Koopman eigenfunctions and integrated into the training objective promotes stability, improving robustness in extreme damage scenarios. Experiments in high-fidelity simulations show superior tracking and reduced crash rates compared with industry FCS and RL baselines, with the DT's lightweight footprint enabling feasible embedded deployment after quantization. The approach offers a practical path toward safe and adaptive UAV operations by unifying real-time adaptability, fault tolerance, and computational efficiency in a single transformer-based framework.
Abstract
This study presents a transformer-based approach for fault-tolerant control in fixed-wing Unmanned Aerial Vehicles (UAVs), designed to adapt in real time to dynamic changes caused by structural damage or actuator failures. Unlike traditional Flight Control Systems (FCSs) that rely on classical control theory and struggle under severe alterations in dynamics, our method directly maps outer-loop reference values -- altitude, heading, and airspeed -- into control commands using the in-context learning and attention mechanisms of transformers, thus bypassing inner-loop controllers and fault-detection layers. Employing a teacher-student knowledge distillation framework, the proposed approach trains a student agent with partial observations by transferring knowledge from a privileged expert agent with full observability, enabling robust performance across diverse failure scenarios. Experimental results demonstrate that our transformer-based controller outperforms industry-standard FCS and state-of-the-art reinforcement learning (RL) methods, maintaining high tracking accuracy and stability in nominal conditions and extreme failure cases, highlighting its potential for enhancing UAV operational safety and reliability.
