A Data-Driven Algorithm for Model-Free Control Synthesis
Sean Bowerfind, Matthew R. Kirchner, Gary Hewer
TL;DR
This work develops a data-driven framework for synthesizing the infinite-horizon LQR controller for continuous-time systems without explicit dynamics models. It exploits a necessary condition on the value function, discretizes it, and formulates an implicit NLP to recover the LQR gain $K$ (and a feedforward $F$ for tracking) from finite data, robust to noise via a PSD factorization $P=L^TL$. The method extends to reference tracking and to mixed-model scenarios where part of the dynamics is known, solving for $(K,F)$ or $(K,F)$ with the known dynamics in the loop. Demonstrations include a known linear-system benchmark and a real-world flight-test on a subscale UAV, showing model-free controllers closely match traditional LQR performance and effectively track references under realistic conditions. The approach offers a practical route to LQR-like control when modeling is difficult or impractical, with potential for real-time adaptive extensions and data-driven validation in aerospace and other nonlinear domains.
Abstract
Presented is an algorithm to synthesize the optimal infinite-horizon LQR feedback controller for continuous-time systems. The algorithm does not require knowledge of the system dynamics but instead uses only a finite-length sampling of arbitrary input-output data. The algorithm is based on a constrained optimization problem that enforces a necessary condition on the dynamics of the optimal value function along any trajectory. In addition to calculating the standard LQR gain matrix, a feedforward gain can be found to implement a reference tracking controller. This paper presents a theoretical justification for the method and shows several examples, including a validation test on a real scale aircraft.
