Beyond Short Steps in Frank-Wolfe Algorithms
David Martínez-Rubio, Sebastian Pokutta
TL;DR
This paper addresses projection-free convex optimization by improving Frank-Wolfe methods through primal-dual analyses and novel strategies. It introduces an optimistic Frank-Wolfe algorithm and a generalized primal-dual short-step framework, providing computable stopping criteria via the primal-dual gap and convergence guarantees that extend to gradient descent. The key contributions include the optimistic FW with OFTRL/OMD updates, a flexible primal-dual short-step scheme with line-search options, and refined primal-dual convergence rates along with empirical demonstrations of practical advantages. The work advances both the theoretical understanding and the applicability of FW methods, enabling faster, stopping-criterion-driven optimization in large-scale, constraint-heavy scenarios and potentially benefiting broader gradient-descent-based methods. Overall, the proposed methods offer tighter dual bounds and improved adaptability to curvature while preserving projection-free updates.
Abstract
We introduce novel techniques to enhance Frank-Wolfe algorithms by leveraging function smoothness beyond traditional short steps. Our study focuses on Frank-Wolfe algorithms with step sizes that incorporate primal-dual guarantees, offering practical stopping criteria. We present a new Frank-Wolfe algorithm utilizing an optimistic framework and provide a primal-dual convergence proof. Additionally, we propose a generalized short-step strategy aimed at optimizing a computable primal-dual gap. Interestingly, this new generalized short-step strategy is also applicable to gradient descent algorithms beyond Frank-Wolfe methods. As a byproduct, our work revisits and refines primal-dual techniques for analyzing Frank-Wolfe algorithms, achieving tighter primal-dual convergence rates. Empirical results demonstrate that our optimistic algorithm outperforms existing methods, highlighting its practical advantages.
