Randomized Kaczmarz with geometrically smoothed momentum
Seth J. Alderman, Roan W. Luikart, Nicholas F. Marshall
TL;DR
The paper introduces the randomized Kaczmarz with geometrically smoothed momentum (KGSM) to solve linear least-squares problems and proves a directional convergence result for the error along singular vectors, extending prior work on directional decay. KGSM updates incorporate momentum that is geometrically smoothed by a parameter $\beta$ and a momentum factor $M$, yielding closed-form behavior for $\mathbb{E}\langle x_{k+1}-x, v_l\rangle$ in terms of $r=1-\sigma_l^2/\|A\|_F^2+M(1-\beta)$ and $\zeta=M(1-\beta)^2$. The authors show that for $M$ and $\beta$ in suitable ranges, KGSM can accelerate convergence in directions associated with small singular values, recover standard Kaczmarz when $M=0$, and optimize smoothing by selecting $\beta$; a rich set of numerical experiments illustrates the dynamics, including complex eigenvalue regimes and spiking behavior. The work motivates further exploration of adaptive parameter selection, extensions to Nesterov-like schemes, and block methods, with implications for faster linear-system solvers and insight into momentum-based stochastic optimization in linear settings.
Abstract
This paper studies the effect of adding geometrically smoothed momentum to the randomized Kaczmarz algorithm, which is an instance of stochastic gradient descent on a linear least squares loss function. We prove a result about the expected error in the direction of singular vectors of the matrix defining the least squares loss. We present several numerical examples illustrating the utility of our result and pose several questions.
