Fast convergence of Frank-Wolfe algorithms on polytopes
Elias Wirth, Javier Pena, Sebastian Pokutta
TL;DR
This paper develops an affine-invariant template for deriving convergence rates of Frank-Wolfe variants on polytopes, relying on two core properties: extended curvature and Hölderian error bound. The framework yields rates that interpolate between sublinear and linear, controlled by the exponent $\theta \in (0,\tfrac{1}{2}]$, and it applies uniformly to vanilla FW, away-step FW, blended pairwise FW, and in-face FW. Convergence guarantees are expressed via problem-distance measures tied to geometry: radial distance $\mathfrak{r}$ for vanilla FW, vertex distance $\mathfrak{v}$ for AFW/BPFW, and face distance $\mathfrak{f}$ for IFW, with dimension-dependent refinements for standard-form polytopes and simplex-like polytopes. A key insight is that local facial geometry drives the error bounds, enabling dimension-independent rates and sharpening comparisons to global-width-based analyses, while recovering and extending prior results for simplex-like structures.
Abstract
We provide a template to derive convergence rates for the following popular versions of the Frank-Wolfe algorithm on polytopes: vanilla Frank-Wolfe, Frank-Wolfe with away steps, Frank-Wolfe with blended pairwise steps, and Frank-Wolfe with in-face directions. Our template shows how the convergence rates follow from two affine-invariant properties of the problem, namely, error bound and extended curvature. These properties depend solely on the polytope and objective function but not on any affine-dependent object like norms. For each one of the above algorithms, we derive rates of convergence ranging from sublinear to linear depending on the degree of the error bound.
