A Primal-Dual Approach to Solving Variational Inequalities with General Constraints
Tatjana Chavdarova, Tong Yang, Matteo Pagliardini, Michael I. Jordan
TL;DR
The paper develops and analyzes a primal-dual approach (ACVI) for solving variational inequalities under general constraints, removing the need for exact subproblem solutions via a warm-started inexact variant (I-ACVI). It establishes nonasymptotic last-iterate convergence rates of $O(1/\\sqrt{K})$ for monotone VIs without assuming $L$-Lipschitzness, and shows that I-ACVI preserves this rate under suitable error decay. A projection-free specialization (P-ACVI) is introduced for simple inequality constraints, preserving the same rate, and a projection-based variant (PI-ACVI) is analyzed for efficiency. The paper provides extensive experiments on 2D and high-dimensional games and a constrained GAN on MNIST, demonstrating faster wall-clock convergence with warm-starting and illustrating practical gains over projection-based baselines. Overall, these results advance last-iterate guarantees for VI solvers with general constraints and practical, scalable algorithms for large-scale problems.
Abstract
Yang et al. (2023) recently showed how to use first-order gradient methods to solve general variational inequalities (VIs) under a limiting assumption that analytic solutions of specific subproblems are available. In this paper, we circumvent this assumption via a warm-starting technique where we solve subproblems approximately and initialize variables with the approximate solution found at the previous iteration. We prove the convergence of this method and show that the gap function of the last iterate of the method decreases at a rate of $O(\frac{1}{\sqrt{K}})$ when the operator is $L$-Lipschitz and monotone. In numerical experiments, we show that this technique can converge much faster than its exact counterpart. Furthermore, for the cases when the inequality constraints are simple, we introduce an alternative variant of ACVI and establish its convergence under the same conditions. Finally, we relax the smoothness assumptions in Yang et al., yielding, to our knowledge, the first convergence result for VIs with general constraints that does not rely on the assumption that the operator is $L$-Lipschitz.
