Fast convex optimization via closed-loop time scaling of gradient dynamics
Hedy Attouch, Radu Ioan Bot, Dang-Khoa Nguyen
TL;DR
The paper addresses fast convex optimization by designing adaptive accelerated gradient methods whose damping is controlled in a closed-loop fashion. It introduces time-scaling and averaging to produce autonomous damped inertial dynamics with vanishing viscous damping and Hessian-driven damping, yielding fast convergence of objective values, gradient decay, and convergence to minimizers; these continuous Dynamics inspire proximal discretizations and adaptive step-size rules. The authors derive general time-scale results, specialized to velocity- and gradient-based damping, and compare with Lin–Jordan, demonstrating improved convergence rates such as $f(x(t))-\,\inf f = o(1/t^{2p})$ for certain parameters, while keeping trajectory convergence guarantees. They then develop proximal-explicit discretizations (PEAS) and inertial proximal algorithms with adaptive stepsizes, provide a geometric interpretation, extend to nonsmooth settings via Moreau envelopes, and validate the approach with numerical experiments that show competitive performance and reduced oscillations. Overall, the work provides a flexible, theoretically grounded framework for autonomous, adaptive acceleration in convex optimization with potential extensions to composite, monotone, and stochastic settings.
Abstract
In a Hilbert setting, for convex differentiable optimization, we develop a general framework for adaptive accelerated gradient methods. They are based on damped inertial dynamics where the coefficients are designed in a closed-loop way. Specifically, the damping is a feedback control of the velocity, or of the gradient of the objective function. For this, we develop a closed-loop version of the time scaling and averaging technique introduced by the authors. We thus obtain autonomous inertial dynamics which involve vanishing viscous damping and implicit Hessian driven damping. By simply using the convergence rates for the continuous steepest descent and Jensen's inequality, without the need for further Lyapunov analysis, we show that the trajectories have several remarkable properties at once: they ensure fast convergence of values, fast convergence of the gradients towards zero, and they converge to optimal solutions. Our approach leads to parallel algorithmic results, that we study in the case of proximal algorithms. These are among the very first general results of this type obtained using autonomous dynamics. Since the proposed numerical methods are based on proximal techniques, the results can be extended to a broader class, specifically to the problem of minimizing a proper, lower semicontinuous, and convex function. Numerical experiments are conducted to demonstrate the efficiency of the proposed methods.
