An Adaptive Regularized Proximal Newton-Type Methods for Composite Optimization over the Stiefel Manifold
Qinsi Wang, Wei Hong Yang
TL;DR
This work develops two adaptive regularized Newton-type methods for composite optimization on the Stiefel manifold: ARPQN, a proximal quasi-Newton method with adaptive quadratic regularization, and ARPN, a proximal Newton variant using retreated Newton steps. ARPQN achieves global convergence with a guaranteed local linear rate, while ARPN attains global convergence with local superlinear convergence under standard smoothness and Hessian assumptions; both employ line-searched retractions and LBFGS-based Hessian approximations to solve tangent-space subproblems, with ASSN as a practical solver for the subproblems. Empirical results on compressed modes and sparse PCA demonstrate that ARPQN, particularly with SVD retraction, often outperforms existing proximal-Newton-type methods in iteration count and runtime, whereas ARPN shows superior theoretical convergence properties at higher computational cost. The work contributes a rigorous convergence and complexity analysis for manifold-based regularized Newton methods and validates their practical effectiveness in non-Euclidean composite optimization.
Abstract
Recently, the proximal Newton-type method and its variants have been generalized to solve composite optimization problems over the Stiefel manifold whose objective function is the summation of a smooth function and a nonsmooth function. In this paper, we propose an adaptive quadratically regularized proximal quasi-Newton method, named ARPQN, to solve this class of problems. Under some mild assumptions, the global convergence, the local linear convergence rate and the iteration complexity of ARPQN are established. Numerical experiments and comparisons with other state-of-the-art methods indicate that ARPQN is very promising. We also propose an adaptive quadratically regularized proximal Newton method, named ARPN. It is shown the ARPN method has a local superlinear convergence rate under certain reasonable assumptions, which demonstrates attractive convergence properties of regularized proximal Newton methods.
