Small-Disturbance Input-to-State Stability of Perturbed Gradient Flows: Applications to LQR Problem

Leilei Cui; Zhong-Ping Jiang; Eduardo D. Sontag

Small-Disturbance Input-to-State Stability of Perturbed Gradient Flows: Applications to LQR Problem

Leilei Cui, Zhong-Ping Jiang, Eduardo D. Sontag

TL;DR

The paper addresses robustness of gradient-flow optimization under perturbations by establishing small-disturbance ISS for perturbed gradient flows under coercivity and a CJS-PL gradient-dominance condition. It develops a Lyapunov characterization of SSD, proves sufficiency (and discussion of necessity) for the ISS property, and applies the theory to LQR policy optimization, showing SSD for standard, natural, and Newton gradient flows. The LQR analysis demonstrates that the loss J2(K) is coercive with a gradient-dominance bound, enabling explicit ISS-Lyapunov constructions and robust convergence bounds under gradient estimation or numerical perturbations. This work provides a principled framework to quantify robustness of data-driven control and RL methods that rely on gradient-based policy updates, with concrete guarantees on convergence to neighborhoods of the optimum under bounded disturbances.

Abstract

This paper studies the effect of perturbations on the gradient flow of a general nonlinear programming problem, where the perturbation may arise from inaccurate gradient estimation in the setting of data-driven optimization. Under suitable conditions on the objective function, the perturbed gradient flow is shown to be small-disturbance input-to-state stable (ISS), which implies that, in the presence of a small-enough perturbation, the trajectories of the perturbed gradient flow must eventually enter a small neighborhood of the optimum. This work was motivated by the question of robustness of direct methods for the linear quadratic regulator problem, and specifically the analysis of the effect of perturbations caused by gradient estimation or round-off errors in policy optimization. We show small-disturbance ISS for three of the most common optimization algorithms: standard gradient flow, natural gradient flow, and Newton gradient flow.

Small-Disturbance Input-to-State Stability of Perturbed Gradient Flows: Applications to LQR Problem

TL;DR

Abstract

Paper Structure (10 sections, 21 theorems, 121 equations)

This paper contains 10 sections, 21 theorems, 121 equations.

Introduction
Notations and Preliminaries
Small-Disturbance Input-to-State Stability
Robustness Analysis of Perturbed Gradient Flows
Application to LQR Problem
Preliminaries of LQR
Perturbed Standard Gradient Flow
Perturbed Natural Gradient Flow
Perturbed Newton Gradient Flow
Conclusions

Key Result

Lemma 2.1

For any $X,Y,Z \in \mathbb{R}^{n \times n}$, $\mathrm{Tr}\left(XYZ\right) = \mathrm{Tr}\left(ZXY\right) = \mathrm{Tr}\left(YZX\right)$.

Theorems & Definitions (46)

Definition 2.1: Definitions 2.5 and 24.2 in book_Hahn
Lemma 2.1: The cyclic property of the trace, equation (16) in book_Petersen
Lemma 2.2: Trace inequality Wang1986
Lemma 2.3: Cauchy-Schwarz inequality
proof
Lemma 2.4
proof
Lemma 2.5
proof
Lemma 2.6: Weak triangle inequality in Jiang1994
...and 36 more

Small-Disturbance Input-to-State Stability of Perturbed Gradient Flows: Applications to LQR Problem

TL;DR

Abstract

Small-Disturbance Input-to-State Stability of Perturbed Gradient Flows: Applications to LQR Problem

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (46)