Accelerated Affine-Invariant Convergence Rates of the Frank-Wolfe Algorithm with Open-Loop Step-Sizes

Elias Wirth; Javier Pena; Sebastian Pokutta

Accelerated Affine-Invariant Convergence Rates of the Frank-Wolfe Algorithm with Open-Loop Step-Sizes

Elias Wirth, Javier Pena, Sebastian Pokutta

TL;DR

All known non-affine-invariant convergence rates for Frank-Wolfe algorithm with open-loop step-sizes with affine invariance are extended to affine-Invariant results.

Abstract

Recent papers have shown that the Frank-Wolfe algorithm (FW) with open-loop step-sizes exhibits rates of convergence faster than the iconic $\mathcal{O}(t^{-1})$ rate. In particular, when the minimizer of a strongly convex function over a polytope lies in the relative interior of a feasible region face, the FW with open-loop step-sizes $η_t = \frac{\ell}{t+\ell}$ for $\ell \in \mathbb{N}_{\geq 2}$ has accelerated convergence $\mathcal{O}(t^{-2})$ in contrast to the rate $Ω(t^{-1-ε})$ attainable with more complex line-search or short-step step-sizes. Given the relevance of this scenario in data science problems, research has grown to explore the settings enabling acceleration in open-loop FW. However, despite FW's well-known affine invariance, existing acceleration results for open-loop FW are affine-dependent. This paper remedies this gap in the literature by merging two recent research trajectories: affine invariance (Wirth et al., 2023b) and open-loop step-sizes (Pena, 2021). In particular, we extend all known non-affine-invariant convergence rates for FW with open-loop step-sizes to affine-invariant results.

Accelerated Affine-Invariant Convergence Rates of the Frank-Wolfe Algorithm with Open-Loop Step-Sizes

TL;DR

All known non-affine-invariant convergence rates for Frank-Wolfe algorithm with open-loop step-sizes with affine invariance are extended to affine-Invariant results.

Abstract

Recent papers have shown that the Frank-Wolfe algorithm (FW) with open-loop step-sizes exhibits rates of convergence faster than the iconic

rate. In particular, when the minimizer of a strongly convex function over a polytope lies in the relative interior of a feasible region face, the FW with open-loop step-sizes

for

has accelerated convergence

in contrast to the rate

attainable with more complex line-search or short-step step-sizes. Given the relevance of this scenario in data science problems, research has grown to explore the settings enabling acceleration in open-loop FW. However, despite FW's well-known affine invariance, existing acceleration results for open-loop FW are affine-dependent. This paper remedies this gap in the literature by merging two recent research trajectories: affine invariance (Wirth et al., 2023b) and open-loop step-sizes (Pena, 2021). In particular, we extend all known non-affine-invariant convergence rates for FW with open-loop step-sizes to affine-invariant results.

Paper Structure (9 sections, 7 theorems, 35 equations, 1 algorithm)

This paper contains 9 sections, 7 theorems, 35 equations, 1 algorithm.

Introduction
Contributions
Paper outline
Preliminaries
Measures of optimality
Growth properties
Curvature of the feasible region and objective
Strong $(M, r)$-growth
Convergence rates

Key Result

Proposition 2.6

For $p\in]1, \infty[$, let ${\mathcal{C}}\xspace\subseteq \mathbb{R}^n$ be the $\ell_p$-ball. Then, the following holds:

Theorems & Definitions (19)

Definition 1.1: Affine covariance
Definition 2.1: Growth properties
Definition 2.2: Relaxed gaps growth property
Definition 2.3: ${\mathcal{C}}\xspace$-norm
Definition 2.4: Uniformly convex set
Proposition 2.6: hanner1956uniform
Definition 2.7: Smoothness
Proposition 2.8
proof
Definition 2.9: Hölderian error bound
...and 9 more

Accelerated Affine-Invariant Convergence Rates of the Frank-Wolfe Algorithm with Open-Loop Step-Sizes

TL;DR

Abstract

Accelerated Affine-Invariant Convergence Rates of the Frank-Wolfe Algorithm with Open-Loop Step-Sizes

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (19)