Revisiting Frank-Wolfe for Structured Nonconvex Optimization
Hoomaan Maskan, Yikun Hou, Suvrit Sra, Alp Yurtsever
TL;DR
The paper addresses nonconvex optimization expressed as a difference of convex functions under a projection-free setting. It introduces Dc-Fw, a framework that couples the DC Algorithm with Frank-Wolfe, and analyzes two natural DC decompositions that yield distinct variants, including a gradient-efficient CGS-style method and an inexact proximal-point method. Theoretical results show first-order stationarity in 𝒪(1/ε^2) FW steps and, for suitably structured domains, improved gradient and LMO complexities, with empirical validation on quadratic assignment and partially observed embedding alignment. The work demonstrates how problem reformulation via DC decompositions can meaningfully enhance projection-free optimization, and points to stochastic and adaptive decomposition extensions as promising directions.
Abstract
We introduce a new projection-free (Frank-Wolfe) method for optimizing structured nonconvex functions that are expressed as a difference of two convex functions. This problem class subsumes smooth nonconvex minimization, positioning our method as a promising alternative to the classical Frank-Wolfe algorithm. DC decompositions are not unique; by carefully selecting a decomposition, we can better exploit the problem structure, improve computational efficiency, and adapt to the underlying problem geometry to find better local solutions. We prove that the proposed method achieves a first-order stationary point in $O(1/ε^2)$ iterations, matching the complexity of the standard Frank-Wolfe algorithm for smooth nonconvex minimization in general. Specific decompositions can, for instance, yield a gradient-efficient variant that requires only $O(1/ε)$ calls to the gradient oracle. Finally, we present numerical experiments demonstrating the effectiveness of the proposed method compared to other projection-free algorithms.
