Adaptive Preconditioned Gradient Descent with Energy
Hailiang Liu, Levon Nurbekyan, Xuping Tian, Yunan Yang
TL;DR
This paper addresses constrained optimization by integrating an energy-based adaptive step size with preconditioned gradient descent. The authors introduce the Adaptive Energy Preconditioned Gradient (AEPG) method, which extends the Adaptive Energy Gradient Descent framework to preconditioned directions, enabling unconditional energy stability and provable convergence rates for general, PL, and convex objectives. They develop and analyze two concrete instances, Hessian-Riemannian gradient descent (HRGD) and natural gradient descent (NGD), and unify HRGD with NGD under a Hessian metric, including a projection mechanism for linear equality constraints. Numerical results demonstrate that AEPG accelerates convergence over classical preconditioned methods, particularly for ill-conditioned or nonconvex problems, and show strong performance in D-optimal design and Wasserstein-based tasks, indicating broad practical impact for constrained optimization in engineering and ML contexts.
Abstract
We propose an adaptive step size with an energy approach for a suitable class of preconditioned gradient descent methods. We focus on settings where the preconditioning is applied to address the constraints in optimization problems, such as the Hessian-Riemannian and natural gradient descent methods. More specifically, we incorporate these preconditioned gradient descent algorithms in the recently introduced Adaptive Energy Gradient Descent (AEGD) framework. In particular, we discuss theoretical results on the unconditional energy-stability and convergence rates across three classes of objective functions. Furthermore, our numerical results demonstrate excellent performance of the proposed method on several test bed optimization problems.
