Stability of Certainty-Equivalent Adaptive LQR for Linear Systems with Unknown Time-Varying Parameters

Marcell Bartos; Johannes Köhler; Florian Dörfler; Melanie N. Zeilinger

Stability of Certainty-Equivalent Adaptive LQR for Linear Systems with Unknown Time-Varying Parameters

Marcell Bartos, Johannes Köhler, Florian Dörfler, Melanie N. Zeilinger

TL;DR

The paper tackles online stabilization of discrete-time linear time-varying systems with unknown time-varying parameters and bounded disturbances. It introduces a modular pipeline that combines a projected least-mean-square (LMS) model learner with a certainty-equivalent linear-quadratic regulator (LQR) controller, enabling non-episodic updates at every time step. A time-varying Lyapunov analysis shows finite-gain $\ell^2$-stability of the closed-loop interconnection under a small parameter-diameter and step size, even without persistent excitation, and simulations on a nonlinear planar quadrotor demonstrate practical stability and potential parameter learning. The method is computationally light, robust to parameter drift, and suitable for real-time online adaptation, with future work aimed at proving parameter convergence under excitation and extending to broader system classes.

Abstract

Standard model-based control design deteriorates when the system dynamics change during operation. To overcome this challenge, online and adaptive methods have been proposed in the literature. In this work, we consider the class of discrete-time linear systems with unknown time-varying parameters. We propose a simple, modular, and computationally tractable approach by combining two classical and well-known building blocks from estimation and control: the least mean square filter and the certainty-equivalent linear quadratic regulator. Despite both building blocks being simple and off-the-shelf, our analysis shows that they can be seamlessly combined to a powerful pipeline with stability guarantees. Namely, finite-gain $\ell^2$-stability of the closed-loop interconnection of the unknown system, the parameter estimator, and the controller is proven, despite the presence of unknown disturbances and time-varying parametric uncertainties. Real-world applicability of the proposed algorithm is showcased by simulations carried out on a nonlinear planar quadrotor.

Stability of Certainty-Equivalent Adaptive LQR for Linear Systems with Unknown Time-Varying Parameters

TL;DR

Abstract

Stability of Certainty-Equivalent Adaptive LQR for Linear Systems with Unknown Time-Varying Parameters

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (3)

Theorems & Definitions (12)