On the Regret of Recursive Methods for Discrete-Time Adaptive Control with Matched Uncertainty
Aren Karapetyan, Efe C. Balta, Anastasios Tsiamis, Andrea Iannelli, John Lygeros
TL;DR
This work addresses finite-regret adaptive control for discrete-time nonlinear systems with matched uncertainty by formulating online parameter learning as minimization of a cost that aligns with uncertainty cancellation. It introduces Recursive Proximal Learning (RPL), a proximal-point-based update, and proves parameter-space contraction and closed-loop finite regret under a sufficient excitation condition, with improved constants under additional assumptions. It also analyzes Recursive Least Squares with Exponential Forgetting (RLSFF), showing finite regret under a stronger persistence of excitation condition and comparing the two methods in terms of regret performance. A discrete-time MRAC example demonstrates that RPL can achieve superior tracking and regret performance compared with RLSFF and a reference governor MRAC, highlighting the practical viability of causal, online, uncertainty-matching adaptive controllers. The results provide a principled path to achieving both stability and performance guarantees in adaptive control under time-varying dynamics, with clear guidance on excitation requirements and method selection for practitioners.
Abstract
Continuous-time adaptive controllers for systems with a matched uncertainty often comprise an online parameter estimator and a corresponding parameterized controller to cancel the uncertainty. However, such methods are often impossible to implement directly, as they depend on an unobserved estimation error. We consider the equivalent discrete-time setting with a causal information structure, and propose a novel, online proximal point method-based adaptive controller, that under a sufficient excitation (SE) condition is asymptotically stable and achieves finite regret, scaling only with the time required to fulfill the SE. We show the same also for the widely-used recursive least squares with exponential forgetting controller under a stronger persistence of excitation condition.
