Continuized Nesterov Acceleration for Non-Convex Optimization
Julien Hermant, Jean-François Aujol, Charles Dossal, Lorick Huang, Aude Rondepierre
TL;DR
This work extends the continuized Nesterov framework to non-convex optimization by incorporating non-smooth Lyapunov functions and strengthening trajectory-wise guarantees. The authors develop an Itô-style tool for non-smooth Lyapunov analysis, derive sharp high-probability and almost-sure bounds, and improve convergence rates for strongly quasar-convex functions while relaxing minimizer-uniqueness assumptions. The method combines continuous-time Lyapunov analysis with exact discrete realizations of a Nesterov momentum scheme driven by random jump times, enabling a tight transfer of convergence properties from the continuous to the discrete setting. Compared to classical ODE-based treatments, the continuized approach yields robust, trajectory-wise guarantees without requiring subroutines or strict convexity, thus offering practical and theoretical advances for non-convex first-order optimization with momentum. Overall, the results tighten constants in convergence rates and broaden applicability to non-smooth, non-convex landscapes, with explicit high-probability and almost-sure convergence statements grounded in stochastic calculus.
Abstract
In convex optimization, continuous-time counterparts have been a fruitful tool for analyzing momentum algorithms. Fewer such examples are available when the function to minimize is non-convex. In several cases, discrepancies arise between the existing discrete-time results, namely those obtained for momentum algorithms, and their continuous-time counterparts, with the latter typically yielding stronger guarantees. We argue that the continuized framework (Even et al., 2021), mixing continuous and discrete components, can tighten the gap between known continuous and discrete results. This framework relies on computations akin to standard Lyapunov analyses, from which are deduced convergence bounds for an algorithm that can be written as a Nesterov momentum algorithm with stochastic parameters. In this work, we extend the range of applicability of the continuized framework, e.g. by allowing it to handle non-smooth Lyapunov functions. We then strengthen its trajectory-wise guarantees for linear convergence rate, deriving finite time bounds with high probability and asymptotic almost sure bounds. We apply this framework to the non-convex class of strongly quasar convex functions. Adapting continuous-time results that have weaker discrete equivalents to the continuized method, we improve by a constant factor the known convergence rate, and relax the existing assumptions on the set of minimizers.
