Mini-Extragradient Methods
Xiaozhi Liu, Yong Xia
TL;DR
This work addresses solving monotone nonlinear equations with Extragradient (EG) methods by removing dependence on the global Lipschitz constant $L$ for stepsize and by reducing per-iteration cost from two full mappings to coordinate-wise updates. It introduces three Mini-EG variants—Greedy Mini-EG, Random Mini-EG, and Watchdog-Max—that rely on componentwise Lipschitz constants $l_i$ and update only one (or two, for Watchdog-Max) coordinates per step, while preserving convergence guarantees. Theoretical results establish ergodic and/or expected convergence rates, with Greedy Mini-EG offering sharper rates in standard settings and Watchdog-Max achieving substantial practical speedups. Empirical tests on regularized decentralized logistic regression and compressed sensing show speedups up to and beyond $13\times$ compared with classical EG, highlighting significant gains in computational efficiency for large-scale problems.
Abstract
The Extragradient (EG) method stands as a cornerstone algorithm for solving monotone nonlinear equations but faces two important unresolved challenges: (i) how to select stepsizes without relying on the global Lipschitz constant or expensive line-search procedures, and (ii) how to reduce the two full evaluations of the mapping required per iteration to effectively one, without compromising convergence guarantees or computational efficiency. To address the first challenge, we propose the Greedy Mini-Extragradient (Mini-EG) method, which updates only the coordinate associated with the dominant component of the mapping at each extragradient step. This design capitalizes on componentwise Lipschitz constants that are far easier to estimate than the classical global Lipschitz constant. To further lower computational cost, we introduce a Random Mini-EG variant that replaces full mapping evaluations by sampling only a single coordinate per extragradient step. Although this resolves the second challenge from a theoretical standpoint, its practical efficiency remains limited. To bridge this gap, we develop the Watchdog-Max strategy, motivated by the slow decay of dominant component magnitudes. Instead of evaluating the full mapping, Watchdog-Max identifies and tracks only two coordinates at each extragradient step, dramatically reducing per-iteration cost while retaining strong practical performance. We establish convergence guarantees and rate analyses for all proposed methods. In particular, Greedy Mini-EG achieves enhanced convergence rates that surpass the classical guarantees of the vanilla EG method in several standard application settings. Numerical experiments on regularized decentralized logistic regression and compressed sensing show speedups exceeding $13\times$ compared with the classical EG method on both synthetic and real datasets.
