Fast convergence of Frank-Wolfe algorithms on polytopes

Elias Wirth; Javier Pena; Sebastian Pokutta

Fast convergence of Frank-Wolfe algorithms on polytopes

Elias Wirth, Javier Pena, Sebastian Pokutta

TL;DR

This paper develops an affine-invariant template for deriving convergence rates of Frank-Wolfe variants on polytopes, relying on two core properties: extended curvature and Hölderian error bound. The framework yields rates that interpolate between sublinear and linear, controlled by the exponent $\theta \in (0,\tfrac{1}{2}]$, and it applies uniformly to vanilla FW, away-step FW, blended pairwise FW, and in-face FW. Convergence guarantees are expressed via problem-distance measures tied to geometry: radial distance $\mathfrak{r}$ for vanilla FW, vertex distance $\mathfrak{v}$ for AFW/BPFW, and face distance $\mathfrak{f}$ for IFW, with dimension-dependent refinements for standard-form polytopes and simplex-like polytopes. A key insight is that local facial geometry drives the error bounds, enabling dimension-independent rates and sharpening comparisons to global-width-based analyses, while recovering and extending prior results for simplex-like structures.

Abstract

We provide a template to derive convergence rates for the following popular versions of the Frank-Wolfe algorithm on polytopes: vanilla Frank-Wolfe, Frank-Wolfe with away steps, Frank-Wolfe with blended pairwise steps, and Frank-Wolfe with in-face directions. Our template shows how the convergence rates follow from two affine-invariant properties of the problem, namely, error bound and extended curvature. These properties depend solely on the polytope and objective function but not on any affine-dependent object like norms. For each one of the above algorithms, we derive rates of convergence ranging from sublinear to linear depending on the degree of the error bound.

Fast convergence of Frank-Wolfe algorithms on polytopes

TL;DR

Abstract

Fast convergence of Frank-Wolfe algorithms on polytopes

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (44)