Min-Max Optimization Is Strictly Easier Than Variational Inequalities
Henry Shugart, Jason M. Altschuler
TL;DR
This work reveals a fundamental separation between convex-concave min-max optimization and variational-inequality formulations in the quadratic, unconstrained setting. By translating convergence into extremal polynomial problems and exploiting the geometry of the spectral ranges—an interval for min-max versus a half-disc in VI—the authors prove faster optimal rates for min-max: for strongly-convex-strongly-concave cases, the rate improves by a factor of roughly $3\sqrt{3}/4 \approx 1.3$, and by about $3\sqrt{3}/2 \approx 2.6$ for convex-concave cases. The analysis hinges on Green's functions and conformal mappings to bound extremal polynomials, and demonstrates that asymmetrical, min-max–direct algorithms (e.g., gradient-descent-ascent with slingshot stepsizes) surpass symmetric VI approaches. An adaptivity-extension shows the gap persists even when the algorithm can adapt to observed data, via a duality-based construction of hard instances. Overall, the results motivate designing dedicated min-max algorithms rather than relying on VI reductions, with potential impact on a broad class of saddle-point problems.
Abstract
Classically, a mainstream approach for solving a convex-concave min-max problem is to instead solve the variational inequality problem arising from its first-order optimality conditions. Is it possible to solve min-max problems faster by bypassing this reduction? This paper initiates this investigation. We show that the answer is yes in the textbook setting of unconstrained quadratic objectives: the optimal convergence rate for first-order algorithms is strictly better for min-max problems than for the corresponding variational inequalities. The key reason that min-max algorithms can be faster is that they can exploit the asymmetry of the min and max variables--a property that is lost in the reduction to variational inequalities. Central to our analyses are sharp characterizations of optimal convergence rates in terms of extremal polynomials which we compute using Green's functions and conformal mappings.
