Accelerated Optimization Landscape of Linear-Quadratic Regulator

Lechen Feng; Yuan-Hua Ni

Accelerated Optimization Landscape of Linear-Quadratic Regulator

Lechen Feng, Yuan-Hua Ni

TL;DR

The paper addresses accelerating first-order optimization for linear-quadratic regulators (LQR), handling both state-feedback (SLQR) and output-feedback (OLQR) forms. It proves a Lipschitz Hessian property for the LQR cost and develops an accelerated continuous-time framework with a restarting hybrid dynamics that achieves the Nesterov-optimal rate $1-\frac{1}{\sqrt{\kappa}}$ for SLQR, with a discretized scheme that preserves this rate. For OLQR, it introduces a Hessian-free framework combining semiconvex optimization and negative curvature exploitation, yielding an $\mathcal{O}(\epsilon^{-7/4}\log(1/\epsilon))$ time to an $\epsilon$-stationary point with a second-order guarantee. The approach leverages sublevel-set smoothness, restarting rules, and a penalty-based augmentation to manage nonconvexity, delivering both theoretical convergence rates and practical numerical demonstrations. Together, these results offer a unified, efficient first-order optimization toolbox for LQR policy design with provable acceleration and robustness to nonconvexity.

Abstract

Linear-quadratic regulator (LQR) is a landmark problem in the field of optimal control, which is the concern of this paper. Generally, LQR is classified into state-feedback LQR (SLQR) and output-feedback LQR (OLQR) based on whether the full state is obtained. It has been suggested in existing literature that both SLQR and OLQR could be viewed as \textit{constrained nonconvex matrix optimization} problems in which the only variable to be optimized is the feedback gain matrix. In this paper, we introduce a first-order accelerated optimization framework of handling the LQR problem, and give its convergence analysis for the cases of SLQR and OLQR, respectively. Specifically, a Lipschiz Hessian property of LQR performance criterion is presented, which turns out to be a crucial property for the application of modern optimization techniques. For the SLQR problem, a continuous-time hybrid dynamic system is introduced, whose solution trajectory is shown to converge exponentially to the optimal feedback gain with Nesterov-optimal order $1-\frac{1}{\sqrtκ}$ ($κ$ the condition number). Then, the symplectic Euler scheme is utilized to discretize the hybrid dynamic system, and a Nesterov-type method with a restarting rule is proposed that preserves the continuous-time convergence rate, i.e., the discretized algorithm admits the Nesterov-optimal convergence order. For the OLQR problem, a Hessian-free accelerated framework is proposed, which is a two-procedure method consisting of semiconvex function optimization and negative curvature exploitation. In a time $\mathcal{O}(ε^{-7/4}\log(1/ε))$, the method can find an $ε$-stationary point of the performance criterion; this entails that the method improves upon the $\mathcal{O}(ε^{-2})$ complexity of vanilla gradient descent. Moreover, our method provides the second-order guarantee of stationary point.

Accelerated Optimization Landscape of Linear-Quadratic Regulator

TL;DR

for SLQR, with a discretized scheme that preserves this rate. For OLQR, it introduces a Hessian-free framework combining semiconvex optimization and negative curvature exploitation, yielding an

time to an

-stationary point with a second-order guarantee. The approach leverages sublevel-set smoothness, restarting rules, and a penalty-based augmentation to manage nonconvexity, delivering both theoretical convergence rates and practical numerical demonstrations. Together, these results offer a unified, efficient first-order optimization toolbox for LQR policy design with provable acceleration and robustness to nonconvexity.

Abstract

(

the condition number). Then, the symplectic Euler scheme is utilized to discretize the hybrid dynamic system, and a Nesterov-type method with a restarting rule is proposed that preserves the continuous-time convergence rate, i.e., the discretized algorithm admits the Nesterov-optimal convergence order. For the OLQR problem, a Hessian-free accelerated framework is proposed, which is a two-procedure method consisting of semiconvex function optimization and negative curvature exploitation. In a time

, the method can find an

-stationary point of the performance criterion; this entails that the method improves upon the

complexity of vanilla gradient descent. Moreover, our method provides the second-order guarantee of stationary point.

Paper Structure (16 sections, 18 theorems, 149 equations, 6 figures)

This paper contains 16 sections, 18 theorems, 149 equations, 6 figures.

Introduction
Related Works
Problem Formulation
Properties of $f(K)$ and $\mathcal{S}$
Acceleration Optimization of SLQR Problem
Structure of SLQR Problem
Continuous-time Analysis
Discrete-time Analysis
Acceleration Optimization of OLQR Problem
Semiconvex Function Optimization
Negative Curvature Exploitation
An Acceleration Framework of OLQR Problem
Numerical Examples
Conclusion
Appendix
...and 1 more sections

Key Result

Lemma 1

Optimization problem (eq5) is equivalent to the following matrix constrained optimization problem:

Figures (6)

Figure 1: Structure of feasible set of SLQR problem
Figure 2: Structure of OLQR problem
Figure 3: The result for $n=3$ with $K_0=[5,100,15]$
Figure 4: The result for $n=3$ with $K_0=[1,2,2]$
Figure 5: The result for $n=10$
...and 1 more figures

Theorems & Definitions (49)

Lemma 1: ref2
Lemma 2: ref2
Definition 1
Theorem 1
proof
Theorem 2: ref2
Theorem 3: $M$-Lipschitz Hessian
proof
Remark 1
Definition 2: Hybrid Dynamic System ref9
...and 39 more

Accelerated Optimization Landscape of Linear-Quadratic Regulator

TL;DR

Abstract

Accelerated Optimization Landscape of Linear-Quadratic Regulator

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (49)