Robust Online Convex Optimization for Disturbance Rejection

Joyce Lai; Peter Seiler

Robust Online Convex Optimization for Disturbance Rejection

Joyce Lai, Peter Seiler

TL;DR

Robust online convex optimization for disturbance rejection addresses learning-based disturbance rejection in discrete-time LTI plants under model uncertainty. The authors derive a robust stability condition via a scaled small gain theorem and embed it as an online constraint, enabling a constrained OCO (C-OCO) controller that bounds the learning dynamics in the ℓ∞-norm. They formulate an LFT-based analysis, relate the LTV learning dynamics to a time-varying gain M_t, and demonstrate through simulations that enforcing the stability bound β preserves closed-loop stability when the plant is imperfect. This framework enables stable, real-time disturbance learning for high-precision control applications facing unmodeled dynamics, balancing adaptability with provable robustness.

Abstract

Online convex optimization (OCO) is a powerful tool for learning sequential data, making it ideal for high precision control applications where the disturbances are arbitrary and unknown in advance. However, the ability of OCO-based controllers to accurately learn the disturbance while maintaining closed-loop stability relies on having an accurate model of the plant. This paper studies the performance of OCO-based controllers for linear time-invariant (LTI) systems subject to disturbance and model uncertainty. The model uncertainty can cause the closed-loop to become unstable. We provide a sufficient condition for robust stability based on the small gain theorem. This condition is easily incorporated as an on-line constraint in the OCO controller. Finally, we verify via numerical simulations that imposing the robust stability condition on the OCO controller ensures closed-loop stability.

Robust Online Convex Optimization for Disturbance Rejection

TL;DR

Abstract

Paper Structure (16 sections, 3 theorems, 39 equations, 6 figures)

This paper contains 16 sections, 3 theorems, 39 equations, 6 figures.

Introduction
Problem Formulation
Notation
Model Uncertainty
OCO Control
Model Uncertainty Effects on OCO Control
Main Result
Linear Fractional Transformation
Scaled Small Gain Theorem
Bounding the LTV Dynamics
Application to OCO
Estimator Design
OPGD on an Ideal Cost
Robust OCO Control
Numerical Results
...and 1 more sections

Key Result

Theorem 1

Consider the interconnection $F_U(P,\Gamma)$ where $P:\ell_{pe} \to \ell_{pe}$ and $\Gamma:\ell_{pe} \to \ell_{pe}$ are linear systems with finite induced $\ell_p$-norm. Partition $P$ as: where $\bar{p}:= \left[ \right]$ and $\bar{q}:= \left[ \right]$ are the inputs and outputs of $\Gamma$. The interconnection has finite induced $\ell_p$-norm, i.e. $\|F_U(P,\Gamma)\| < \infty$, if $\|P_{11} \|\

Figures (6)

Figure 1: Discrete-time feedback system with unknown disturbance $d$ and uncertainty $\Delta(z)$. OCO control is used to reject the disturbance $d$ without knowledge of the uncertainty $\Delta(z)$.
Figure 2: Block diagram representation of the OCO controller in a discrete-time feedback system with unknown disturbance $d_t$ and uncertain plant $\tilde{G}(z)$. The OCO controller is composed of a state-feedback gain $K$, an estimator $E(z)$, and an LTV system $M_{LTV}$.
Figure 3: Equivalent LFT $F_U(P,\Gamma)$ of original system separating LTI dynamics $P$ from uncertainty $\Delta$ and time-varying learning dynamics $M_{LTV}$.
Figure 4: Per-step cost (top) and disturbance estimate (bottom) of running U-OCO on a perfect (red dashed) and imperfect (blue solid) plant model. U-OCO is stable with a perfect model and unstable for an imperfect model.
Figure 5: Per-step cost (top) and disturbance estimate (bottom) of running C-OCO at $\beta=1.5$ on a perfect (red dashed) and imperfect (blue solid) plant model. C-OCO is stable for the perfect and imperfect models.
...and 1 more figures

Theorems & Definitions (6)

Theorem 1
proof
Theorem 2
proof
Theorem 3
proof

Robust Online Convex Optimization for Disturbance Rejection

TL;DR

Abstract

Robust Online Convex Optimization for Disturbance Rejection

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (6)