A Framework for Adaptive Stabilisation of Nonlinear Stochastic Systems
Seth Siriya, Jingge Zhu, Dragan Nešić, Ye Pu
TL;DR
This work develops a modular certainty-equivalence adaptive-control framework for discrete-time nonlinear stochastic systems with linearly parameterised uncertainty, combining a parameterised stabilising policy with regularised least squares (RLS) parameter estimation. It derives non-asymptotic, high-probability guarantees: (i) under regional excitation and a robustly positive invariant (RPI) set, a time-varying estimation error bound $e(t,\delta,x_0)$ and almost-sure invariance of the state within the RPI region; (ii) under global excitation and a global stochastic Lyapunov function, high-probability stability bounds hold for any initial condition; and (iii) the results are illustrated on a piecewise-affine system and an input-constrained Gaussian-disturbed linear system. The analysis avoids requiring a priori bounded parameter sets and emphasizes informative data through excitation regions, enabling stability guarantees even when global stabilisability is unavailable. The framework thereby provides a practical, verifiable approach to learning-based adaptive stabilization for a broad class of stochastic nonlinear systems with linear-in-parameters structure, with clear conditions for regional and global stability guarantees. The results have potential impact on reliable autonomous control in uncertain, noisy environments where online learning and adaptation are essential.
Abstract
We consider the adaptive control problem for discrete-time, nonlinear stochastic systems with linearly parameterised uncertainty. Assuming access to a parameterised family of controllers that can stabilise the system in a bounded set within an informative region of the state space when the parameter is well-chosen, we propose a certainty equivalence learning-based adaptive control strategy, and subsequently derive stability bounds on the closed-loop system that hold for some probabilities. We then show that if the entire state space is informative, and the family of controllers is globally stabilising with appropriately chosen parameters, high probability stability guarantees can be derived.
