Linear Convergence and Error Bounds for Optimization Without Strong Convexity
Kira van Treek, Javier F. Peña, Juan C. Vera, Luis F. Zuluaga
TL;DR
The paper addresses linear convergence of fixed-point iterations used to solve optimization problems without relying on strong convexity. It proves that linear convergence of averaged FPIs is equivalent to a local error-bound condition and shows that piecewise linear operators satisfy this bound, enabling practical linear rates for algorithms like ADMM and Douglas–Rachford. The results yield data-independent rates for linear optimization and rate bounds for quadratic optimization that depend only on the problem's condition number, via Hoffman-type constants. This framework provides a robust, general tool for analyzing convergence across LO, QO, and PLQ problems, with direct implications for common optimization algorithms and related fixed-point formulations.
Abstract
Many optimization algorithms$\unicode{x2013}$including gradient descent, proximal methods, and operator splitting techniques$\unicode{x2013}$can be formulated as fixed-point iterations (FPI) of continuous operators. When these operators are averaged, convergence to a fixed point is guaranteed when one exists, but the convergence is generally sublinear. Recent results establish linear convergence of FPI for averaged operators under certain conditions. However, such conditions do not apply to common classes of operators, such as those arising in piecewise linear and quadratic optimization problems. In this work, we prove that a local error-bound condition is both necessary and sufficient for the linear convergence of FPI applied to averaged operators. We provide explicit bounds on the convergence rate and show how these relate to the constants in the error-bound condition. Our main result demonstrates that piecewise linear operators satisfy local error bounds, ensuring linear convergence of the associated optimization algorithms. This leads to a general and practical framework for analyzing convergence behavior in algorithms such as ADMM and Douglas-Rachford in the absence of strong convexity. In particular, we obtain convergence rates that are independent of problem data for linear optimization, and depend only on the condition number of the objective for quadratic optimization.
