A Homogeneous Second-Order Descent Method for Nonconvex Optimization
Chuwen Zhang, Dongdong Ge, Chang He, Bo Jiang, Yuntian Jiang, Chenyu Xue, Yinyu Ye
TL;DR
This work introduces the Homogeneous Second-Order Descent Method (HSODM) for nonconvex optimization by solving a homogenized quadratic model that reduces each step to an eigenvalue problem. By focusing on the leftmost eigenpair of the homogenized matrix, HSODM delivers a simple, single-loop algorithm with an $O(\epsilon^{-3/2})$ global convergence rate to an $\epsilon$-approximate SOSP and a local quadratic rate when near a SOSP. A key innovation is the use of a perturbation parameter $\delta$ and a fixed-radius or line-search strategy to ensure descent, plus an inexact variant using a Lanczos solver with a skewed initialization to maintain robust convergence in large-scale settings. Numerical results on the CUTEst benchmark show HSODM and its Hessian-vector product variant outperform standard second-order methods in several metrics, underscoring its practical appeal for nonconvex optimization.
Abstract
In this paper, we introduce a Homogeneous Second-Order Descent Method (HSODM) using the homogenized quadratic approximation to the original function. The merit of homogenization is that only the leftmost eigenvector of a gradient-Hessian integrated matrix is computed at each iteration. Therefore, the algorithm is a single-loop method that does not need to switch to other sophisticated algorithms and is easy to implement. We show that HSODM has a global convergence rate of $O(ε^{-3/2})$ to find an $ε$-approximate second-order stationary point, and has a local quadratic convergence rate under the standard assumptions. The numerical results demonstrate the advantage of the proposed method over other second-order methods.
