A Lyapunov analysis of Korpelevich's extragradient method with fast and flexible extensions
Manu Upadhyaya, Puya Latafat, Pontus Giselsson
TL;DR
The paper addresses solving monotone inclusions of the form $0 \in F(z) + \partial g(z)$ by introducing a Lyapunov-based analysis for Korpelevich’s extragradient method, yielding a last-iterate $o(1/k)$ convergence rate for a unifying Lyapunov function that upper-bounds common optimality measures. It then designs three FLEX-type algorithms that blend standard extragradient steps with user-specified directions under Lyapunov-guided line-search, guaranteeing global convergence and enabling superlinear rates when directions are well-chosen (e.g., quasi-Newton, Anderson acceleration). Theoretical results establish descent, quasi-Fejér properties, and convergence for the three variants (FLEX, I-FLEX, Prox-FLEX), with additional results under strong monotonicity and injectivity. Numerical experiments across quadratic minimax problems, bilinear games, Cournot–Nash equilibria, and sparse logistic regression demonstrate the practical effectiveness and simplicity of the approach, particularly when leveraging accelerated directions like Anderson or J-symmetric updates. Overall, the work broadens extragradient analysis to include Lyapunov-based globalization and flexible direction strategies, offering practical globalization tools for fast local directions in monotone inclusion problems.
Abstract
We develop a Lyapunov-based analysis of Korpelevich's extragradient method and show that it achieves an $o(1/k)$ last-iterate convergence rate of the constructed Lyapunov function. This Lyapunov function simultaneously upper bounds several standard measures of optimality, which allows our analysis to sharpen existing last-iterate convergence guarantees for these measures. Moreover, the same analysis enables the design of a class of flexible extensions of the extragradient method in which extragradient steps are adaptively blended with user-specified directions via a Lyapunov-guided line-search procedure. These extensions retain global convergence under practical assumptions and can attain superlinear rates when the directions are chosen appropriately. Numerical experiments confirm the simplicity and efficiency of the proposed framework.
