Revisiting Strong Duality, Hidden Convexity, and Gradient Dominance in the Linear Quadratic Regulator
Yuto Watanabe, Yang Zheng
TL;DR
The paper reexamines the Linear Quadratic Regulator through a primal–dual lens, revealing strong duality for the nonconvex policy problem under stabilizability and detectability, and characterizing when gradient dominance holds via Extended Convex Lifting (ECL) and Cauchy directions. It then shows that static linear policies are globally optimal among all stabilizing policies by leveraging Gramian/covariance representations and SDP-based inner/outer relaxations, with the optimal gain recoverable from the Riccati solution. Collectively, these results connect classical Riccati theory, SDP duality, and modern nonconvex optimization insights to provide a deeper, more unified understanding of LQR landscapes. The findings offer theoretical grounding for model-free and first-order algorithms in LQR and hint at broader applicability to other control problems where convex reformulations and lifting strategies can reveal hidden structure and convergence guarantees.
Abstract
The Linear Quadratic Regulator (LQR) is a cornerstone of optimal control theory, widely studied in both model-based and model-free approaches. Despite its well-established nature, certain foundational aspects remain subtle. In this paper, we revisit three key properties of policy optimization in LQR: (i) strong duality in the nonconvex policy optimization formulation, (ii) the gradient dominance property, examining when it holds and when it fails, and (iii) the global optimality of linear static policies. Using primal-dual analysis and convex reformulation, we refine and clarify existing results by leveraging Riccati equations/inequalities, semidefinite programming (SDP) duality, and a recent framework of Extended Convex Lifting (\texttt{ECL}). Our analysis confirms that LQR 1) behaves almost like a convex problem (e.g., strong duality) under the standard assumptions of stabilizability and detectability and 2) exhibits strong convexity-like properties (e.g., gradient dominance) under slightly stronger conditions. In particular, we establish a broader characterization under which gradient dominance holds using \texttt{ECL} and the notion of Cauchy directions. By clarifying and refining these theoretical insights, we hope this work contributes to a deeper understanding of LQR and may inspire further developments beyond LQR.
