Dynamically accelerating the power iteration with momentum
Christian Austin, Sara Pollock, Yunrong Zhu
TL;DR
The paper addresses accelerating power and inverse power iterations without prior spectral information by introducing a dynamic momentum method that updates the momentum parameter $\beta_k$ at every iteration based on the Rayleigh quotient and two residuals, with no extra matrix-vector multiplies. It analyzes the convergence and stability of this approach via an augmented-matrix framework and proves asymptotic convergence to the dominant eigenpair, including acceleration in the symmetric case; it also reveals that the static optimal choice $\beta = \lambda_2^2/4$ yields a defective augmented matrix. Numerical experiments across multiple benchmark suites demonstrate that the dynamic method often outperforms both standard power iteration and static momentum, and it extends effectively to shifted inverse iterations. The work thereby provides a practical, low-cost accelerator for large-scale eigenvalue problems with broad applicability in computational science and machine learning.
Abstract
In this paper, we propose, analyze and demonstrate a dynamic momentum method to accelerate power and inverse power iterations with minimal computational overhead. The method can be applied to real diagonalizable matrices, is provably convergent with acceleration in the symmetric case, and does not require a priori spectral knowledge. We review and extend background results on previously developed static momentum accelerations for the power iteration through the connection between the momentum accelerated iteration and the standard power iteration applied to an augmented matrix. We show that the augmented matrix is defective for the optimal parameter choice. We then present our dynamic method which updates the momentum parameter at each iteration based on the Rayleigh quotient and two previous residuals. We present convergence and stability theory for the method by considering a power-like method consisting of multiplying an initial vector by a sequence of augmented matrices. We demonstrate the developed method on a number of benchmark problems, and see that it outperforms both the power iteration and often the static momentum acceleration with optimal parameter choice. Finally, we present and demonstrate an explicit extension of the algorithm to inverse power iterations.
