Closed-Loop Finite-Time Analysis of Suboptimal Online Control
Aren Karapetyan, Efe C. Balta, Andrea Iannelli, John Lygeros
TL;DR
This work tackles finite-time suboptimality in online control of discrete-time nonlinear time-varying systems by bounding the extra cost incurred by suboptimal policies relative to a benchmark policy. It introduces and leverages exponential incremental stability ($*ediss$) to derive tight finite-time suboptimality bounds that depend on the suboptimal trajectory pathlength and the benchmark’s convergence rate. The authors prove that exponential stability of the benchmark (even in non-smooth settings) implies $*ediss$ under mild local conditions, enabling accurate transient guarantees for online controllers like suboptimal MPC. A detailed model-predictive-control use case with a suboptimal projected gradient method demonstrates the bounds and highlights a trade-off between computation and suboptimality. The framework provides actionable insights for allocating computational resources in real-time control and has potential applications in adaptive and online control where suboptimality arises from parameter uncertainty or changing costs.
Abstract
Suboptimal methods in optimal control arise due to a limited computational budget, unknown system dynamics, or a short prediction window among other reasons. Although these methods are ubiquitous, their transient performance remains relatively unstudied. We consider the control of discrete-time, nonlinear time-varying dynamical systems and establish sufficient conditions to analyze the finite-time closed-loop performance of such methods in terms of the additional cost incurred due to suboptimality. Finite-time guarantees allow the control design to distribute a limited computational budget over a time horizon and estimate the on-the-go loss in performance due to suboptimality. We study exponential incremental input-to-state stabilizing policies and show that for nonlinear systems, under some mild conditions, this property is directly implied by exponential stability without further assumptions on global smoothness. The analysis is showcased on a suboptimal model predictive control use case.
