Time-optimal neural feedback control of nilpotent systems as a binary classification problem
Sara Bicego, Samuel Gue, Dante Kalise, Nelly Villamizar
TL;DR
This work addresses time-optimal control of linear nilpotent systems with scalar bang-bang inputs by two intertwined strategies: (i) solving the switching-time polynomial system via a deflated Newton method augmented with Hermite quadratic form-based root counting to bound the number of real roots, thereby enabling real-time computation of switching sequences; and (ii) learning a time-optimal feedback map as a binary classifier using synthetic data generated from optimal trajectories, with a confidence-aware augmentation that invokes the open-loop solver when needed. The authors demonstrate scalability up to 5th-order integrators, achieving high classification accuracy (up to 99.61%) and showing robustness to noise, while reducing reliance on computationally expensive algebraic tools such as Gröbner bases. The combination of algebraic root-finding bounds and data-driven feedback yields a practical framework for real-time time-optimal control in nilpotent systems, with clear pathways for extension to other time-optimal or sparse control problems. Overall, the paper provides a scalable pipeline that integrates deflation-based root search, Hermite-based root counting, and neural classifiers to synthesize robust, real-time Bang-bang feedback laws.
Abstract
A computational method for the synthesis of time-optimal feedback control laws for linear nilpotent systems is proposed. The method is based on the use of the bang-bang theorem, which leads to a characterization of the time-optimal trajectory as a parameter-dependent polynomial system for the control switching sequence. A deflated Newton's method is then applied to exhaust all the real roots of the polynomial system. The root-finding procedure is informed by the Hermite quadratic form, which provides a sharp estimate on the number of real roots to be found. In the second part of the paper, the polynomial systems are sampled and solved to generate a synthetic dataset for the construction of a time-optimal deep neural network -- interpreted as a binary classifier -- via supervised learning. Numerical tests in integrators of increasing dimension assess the accuracy, robustness, and real-time-control capabilities of the approximate control law.
