Active set identification and rapid convergence for degenerate primal-dual problems
Mateo Díaz, Pedro Izquierdo Lehmann, Haihao Lu, Jinwen Yang
TL;DR
The paper addresses finite-time active-set identification for convex primal-dual problems solved by first-order methods, even in degenerate settings where strict complementarity fails. It introduces a meta-algorithm framework with a two-sequence scheme and metric-subregularity-based regularity, yielding nonasymptotic bounds for when the algorithm identifies the active set and then achieves linear convergence on a reduced, better-conditioned problem. The main contributions are explicit finite-time identification bounds depending on the local modulus $\alpha_L$ and a fast-linear convergence phase governed by $\alpha_{\mathcal{M}}$, applicable to PPM, PDHG, ADMM, and EGM. The results deliver practical, robust guarantees for two-stage convergence in a broad class of primal-dual algorithms, including linear and convex programs, and are corroborated by numerical experiments showing identification and rapid convergence even under degeneracy.
Abstract
Primal-dual methods for solving convex optimization problems with functional constraints often exhibit a distinct two-stage behavior. Initially, they converge towards a solution at a sublinear rate. Then, after a certain point, the method identifies the set of active constraints and the convergence enters a faster local linear regime. Theory characterizing this phenomenon spans over three decades. However, most existing work only guarantees eventual identification of the active set and relies heavily on nondegeneracy conditions, such as strict complementarity, which often fail to hold in practice. We characterize mild conditions on the problem geometry and the algorithm under which this phenomenon provably occurs. Our guarantees are entirely nonasymptotic and, importantly, do not rely on strict complementarity. Our framework encompasses several widely-used algorithms, including the proximal point method, the primal-dual hybrid gradient method, the alternating direction method of multipliers, and the extragradient method.
