Automated tight Lyapunov analysis for first-order methods
Manu Upadhyaya, Sebastian Banert, Adrien B. Taylor, Pontus Giselsson
TL;DR
This work automates Lyapunov-based convergence analysis for a broad class of fixed-parameter first-order methods solving convex problems by casting the search for a quadratic Lyapunov inequality as a small semidefinite program. It unifies an operator-splitting algorithm representation with interpolation conditions and derives a necessary-and-sufficient SDP feasibility test (Theorem) that certifies Lyapunov functions and residuals over the entire function class. The approach yields both linear and ergodic convergence guarantees and extends duality-gap convergence regions for challenging methods such as Chambolle–Pock when the linear operator is the identity, beyond classical parameter choices. The methodology, validated on Douglas–Rachford, heavy-ball gradient variants, Davis–Yin, and Chambolle–Pock, provides sharper rates and wider operational parameter ranges with practical implications for designing and tuning first-order optimization algorithms.
Abstract
We present a methodology for establishing the existence of quadratic Lyapunov inequalities for a wide range of first-order methods used to solve convex optimization problems. In particular, we consider i) classes of optimization problems of finite-sum form with (possibly strongly) convex and possibly smooth functional components, ii) first-order methods that can be written as a linear system in state-space form in feedback interconnection with the subdifferentials of the functional components of the objective function, and iii) quadratic Lyapunov inequalities that can be used to draw convergence conclusions. We present a necessary and sufficient condition for the existence of a quadratic Lyapunov inequality within a predefined class of Lyapunov inequalities, which amounts to solving a small-sized semidefinite program. We showcase our methodology on several first-order methods that fit the framework. Most notably, our methodology allows us to significantly extend the region of parameter choices that allow for duality-gap convergence in the Chambolle-Pock method when the linear operator is the identity mapping.
