Beyond Expectation: Concentration Inequalities for Randomized Iterative Methods
Toby Anderson, Max Collins, Jamie Haddock, Jackie Lok, Elizaveta Rebrova
TL;DR
The paper addresses the near-worst-case behavior of stochastic iterative methods by deriving concentration and variance bounds for the error, beyond traditional expectation guarantees. It develops a tensor-lifting framework to bound higher-order moments of linear update errors $\mathbf{e}_k = \mathbf{Y}_k \mathbf{e}_{k-1}$, introducing key quantities $\mu = \|\mathbb{E}[ (\mathbf{Y}^T\mathbf{Y})^{\otimes 2}]\|$ and $\eta = \lambda_{\min}(\mathbb{E}[\mathbf{Y}^T\mathbf{Y}])$, which yield $\mathrm{Var}(\|\mathbf{e}_k\|^2) \le (\mu^k - \eta^k) \|\mathbf{e}_0\|^4$ and enable Chebyshev-type concentration and high-probability bounds. Specializing to randomized Kaczmarz and randomized Gauss-Seidel, the authors obtain explicit bounds in terms of singular values of $\mathbf{A}$, and they extend the analysis to nonlinear updates such as RK for linear inequalities. Complementary empirical results illustrate the bounds' behavior under varying conditioning and problem structure. These results provide confidence intervals and trajectory-wide probabilistic guarantees, informing algorithm design and diagnostic use for data-consistency and objective landscape issues in large-scale problems.
Abstract
Stochastic iterative methods are useful in a variety of large-scale numerical linear algebraic, machine learning, and statistical problems, in part due to their low-memory footprint. They are frequently used in a variety of applications, and thus it is imperative to have a thorough theoretical understanding of their behavior. Most theoretical convergence results for stochastic iterative methods provide bounds on the expected error of the iterates, and yield a type of average case analysis. However, understanding the behavior of these methods in the near-worst-case is desirable. For stochastic methods, this motivates providing bounds on the variance and concentration of their error, which can be used to generate confidence intervals around the bounds on their expected error. Here, we provide upper bounds for the concentration and variance of the error of a general class of linear stochastic iterative methods, including the randomized Kaczmarz method and the randomized Gauss--Seidel method, and a more general class of nonlinear stochastic iterative methods, including the randomized Kaczmarz method for systems of linear inequalities.
