Lower Bounds for Frank-Wolfe on Strongly Convex Sets
Jannis Halbey, Daniel Deza, Max Zimmer, Christophe Roux, Bartolomeo Stellato, Sebastian Pokutta
TL;DR
This work investigates whether strong convexity of the constraint set can yield faster convergence for the Frank–Wolfe algorithm beyond the known $\mathcal{O}(1/\sqrt{\varepsilon})$ rate. Focusing on the model problem of minimizing $f(x)=\|x-p\|_2^2$ over the unit ball with $\|p\|=1$, the authors exploit a two-dimensional invariant subspace to analyze FW dynamics via forward-backward trajectory construction. They develop a specialized backward reconstruction technique to synthesize long, high-precision worst-case trajectories and prove a $\Omega(1/\sqrt{\varepsilon})$ lower bound for exact line search and short steps, extending to ellipsoids by affine invariance. The results demonstrate that optimizer position materially affects convergence and that set smoothness alone does not enable a faster uniform rate, highlighting fundamental limits for projection-free methods in this geometric setting. These findings motivate exploring algorithmic variants tailored to strongly convex geometry that could potentially beat the bound while maintaining projection-free advantages.
Abstract
We present a constructive lower bound of $Ω(1/\sqrt{\varepsilon})$ for Frank-Wolfe (FW) when both the objective and the constraint set are smooth and strongly convex, showing that the known uniform $\mathcal{O}(1/\sqrt{\varepsilon})$ guarantees in this regime are tight. It is known that under additional assumptions on the position of the optimizer, FW can converge linearly. However, it remained unclear whether strong convexity of the set can yield rates uniformly faster than $\mathcal{O}(1/\sqrt{\varepsilon})$, i.e., irrespective of the position of the optimizer. To investigate this question, we focus on a simple yet representative problem class: minimizing a strongly convex quadratic over the Euclidean unit ball, with the optimizer on the boundary. We analyze the dynamics of FW for this problem in detail and develop a novel computational approach to construct worst-case FW trajectories, which is of independent interest. Guided by these constructions, we develop an analytical proof establishing the lower bound.
