Linear Last-iterate Convergence in Constrained Saddle-point Optimization
Chen-Yu Wei, Chung-Wei Lee, Mengxiao Zhang, Haipeng Luo
TL;DR
This work advances the understanding of last-iterate convergence for constrained saddle-point optimization by analyzing OGDA and OMWU on bilinear and general constrained problems, notably over probability simplices.A key contribution is proving linear last-iterate convergence for OMWU with a universal constant learning rate under a unique equilibrium, and introducing the Saddle-Point Metric Subregularity (SP-MS) condition under which OGDA achieves concrete linear rates (or sublinear rates when SP-MS holds with beta>0).The results show that bilinear games over polytopes satisfy SP-MS, yielding exponential convergence for OGDA without requiring a unique equilibrium, while strongly convex-strongly concave cases are encompassed and aligned with prior work; the paper also demonstrates that non-polytope curved sets can break linear convergence.Empirical tests on matrix games support the theory, indicating OGDA often outperforms OMWU in last-iterate convergence and illustrating the role of feasible-set geometry in convergence behavior.
Abstract
Optimistic Gradient Descent Ascent (OGDA) and Optimistic Multiplicative Weights Update (OMWU) for saddle-point optimization have received growing attention due to their favorable last-iterate convergence. However, their behaviors for simple bilinear games over the probability simplex are still not fully understood - previous analysis lacks explicit convergence rates, only applies to an exponentially small learning rate, or requires additional assumptions such as the uniqueness of the optimal solution. In this work, we significantly expand the understanding of last-iterate convergence for OGDA and OMWU in the constrained setting. Specifically, for OMWU in bilinear games over the simplex, we show that when the equilibrium is unique, linear last-iterate convergence is achieved with a learning rate whose value is set to a universal constant, improving the result of (Daskalakis & Panageas, 2019b) under the same assumption. We then significantly extend the results to more general objectives and feasible sets for the projected OGDA algorithm, by introducing a sufficient condition under which OGDA exhibits concrete last-iterate convergence rates with a constant learning rate whose value only depends on the smoothness of the objective function. We show that bilinear games over any polytope satisfy this condition and OGDA converges exponentially fast even without the unique equilibrium assumption. Our condition also holds for strongly-convex-strongly-concave functions, recovering the result of (Hsieh et al., 2019). Finally, we provide experimental results to further support our theory.
