Scalable Frank-Wolfe on Generalized Self-concordant Functions via Simple Steps
Alejandro Carderera, Mathieu Besançon, Sebastian Pokutta
TL;DR
This work provides a projection-free Frank-Wolfe framework for generalized self-concordant objectives, proving that a simple open-loop step $\gamma_t=2/(t+2)$ yields $\mathcal{O}(1/t)$ convergence for both the primal and Frank-Wolfe gaps without line searches or higher-order information. It further shows accelerated rates in practical settings: linear convergence when the optimum lies in the interior and when the feasible set is uniformly or strongly convex, with uniform convexity yielding rates that depend on the degree $q$, and polytopes enabling linear rates for AFW and BPCG with backtracking. Comprehensive experiments on portfolio optimization, logistic regression, and the Birkhoff polytope demonstrate the method’s competitive performance and numerical robustness, including in ill-conditioned scenarios, while a stateless variant offers a simpler update rule with comparable guarantees. Overall, the paper advances scalable, parameter-free, projection-free optimization for a broad class of self-concordant-like objectives and clarifies the practical implications of step-size strategies and geometric assumptions on convergence rates.
Abstract
Generalized self-concordance is a key property present in the objective function of many important learning problems. We establish the convergence rate of a simple Frank-Wolfe variant that uses the open-loop step size strategy $γ_t = 2/(t+2)$, obtaining a $\mathcal{O}(1/t)$ convergence rate for this class of functions in terms of primal gap and Frank-Wolfe gap, where $t$ is the iteration count. This avoids the use of second-order information or the need to estimate local smoothness parameters of previous work. We also show improved convergence rates for various common cases, e.g., when the feasible region under consideration is uniformly convex or polyhedral.
