On Persistently Resetting Learning Integrators: A Framework For Model-Free Feedback Optimization
Mahmoud Abdelgalil, Jorge I. Poveda
TL;DR
The paper introduces persistently resetting learning integrators (PRLI) as a novel model-free, zeroth-order gradient oracle for real-time feedback optimization in dynamical systems. By formulating the optimization-and-plant interconnection as a hybrid dynamical system (HDS) and leveraging unbiased persistently exciting signals, PRLIs deliver global, real-time gradient estimates with controllably small perturbations, enabling robust coupling with discrete-time optimization algorithms. The authors establish uniform global asymptotic stability (UGAS) for static cost functions and extend the results to dynamic costs via a singular-perturbation timescale separation, proving semi-global practical stability (SGPAS) as the plant fast dynamics decouple. Numerical simulations validate the framework, demonstrating convergence of gradient descent, heavy-ball, and projected gradient descent under various PRLI-based configurations and showing effective interconnection with dynamic plants. Overall, the work bridges continuous-time physical dynamics with discrete-time optimization, offering a modular, globally stabilizing approach to model-free feedback optimization.
Abstract
We study a novel class of algorithms for solving model-free feedback optimization problems in dynamical systems. The key novelty is the introduction of \emph{persistent resetting learning integrators} (PRLI), which are integrators that are reset at the same frequency at which the plant is dithered using exploratory signals for model-free optimization. It is shown that PRLIs can serve as core mechanisms for real-time gradient estimation in online feedback-optimization tasks where only cost function measurements are available. In particular, unlike existing approaches based on approximation theory, such as averaging or finite-differences, PRLIs can produce global real-time gradient estimates of cost functions, with uniformly bounded perturbations of arbitrarily small magnitude. In this sense, PRLIs function as robust \emph{hybrid} "Oracles" suitable for interconnection with discrete-time optimization algorithms that optimize the performance of continuous-time dynamical plants in closed-loop operation. Compared to existing methods, PRLIs yield \emph{global} stability properties for a broad class of cost functions, surpassing the local or semi-global guarantees offered by traditional approaches based on perturbation and approximation theory. The proposed framework naturally bridges physical systems, modeled as continuous-time plants where continuous exploration is essential, with digital algorithms, represented as discrete-time optimization methods. The main results are illustrated using different numerical examples.
