A Unified Model for High-Resolution ODEs: New Insights on Accelerated Methods
Hoomaan Maskan, Konstantinos C. Zygalakis, Armin Eftekhari, Alp Yurtsever
TL;DR
This work develops a unified high-resolution ODE framework for accelerated gradient methods by deriving a general HR-ODE from a forced Euler–Lagrange equation and analyzing it with integral quadratic constraints and Lyapunov methods. The framework recovers and tightens the continuous-time behavior of HB, NAG, TM, and QHM, and, via Semi-Implicit Euler discretization, yields a single discrete-time scheme that encompasses these methods while providing improved convergence guarantees, including an exact recovery of NAG and enhanced gradient-norm rates. It also connects rate-matching discretization to NAG, showing how NAG can arise as an accurate discretization of a rate-matching ODE, and offers sharper QHM and TM results within the same unified theory. The results advance theoretical understanding of acceleration in smooth convex and strongly convex settings and provide a versatile blueprint for designing accelerated methods with provable rates, while suggesting future work on non-Euclidean extensions and achieving TM’s best possible rates for its exact HR-ODE.
Abstract
Recent work on high-resolution ordinary differential equations (HR-ODEs) captures fine nuances among different momentum-based optimization methods, leading to accurate theoretical insights. However, these HR-ODEs often appear disconnected, each targeting a specific algorithm and derived with different assumptions and techniques. We present a unifying framework by showing that these diverse HR-ODEs emerge as special cases of a general HR-ODE derived using the Forced Euler-Lagrange equation. Discretizing this model recovers a wide range of optimization algorithms through different parameter choices. Using integral quadratic constraints, we also introduce a general Lyapunov function to analyze the convergence of the proposed HR-ODE and its discretizations, achieving significant improvements across various cases, including new guarantees for the triple momentum method$'$s HR-ODE and the quasi-hyperbolic momentum method, as well as faster gradient norm minimization rates for Nesterov$'$s accelerated gradient algorithm, among other advances.
