Randomly sparsified Richardson iteration: A dimension-independent sparse linear solver

Jonathan Weare; Robert J. Webber

Randomly sparsified Richardson iteration: A dimension-independent sparse linear solver

Jonathan Weare, Robert J. Webber

TL;DR

This work introduces randomly sparsified Richardson iteration (RSRI), a dimension-independent sparse linear solver that combines Richardson fixed-point iterations with random sparsification to solve $A x=b$ using at most $m$ nonzeros per update. The authors provide a complete mathematical analysis showing faster-than-Monte Carlo convergence, with explicit bias-variance bounds and conditions under which RSRI converges, even when the solution vector is too large to store densely. The PageRank problem is used as a primary application, where RSRI achieves improved variance decay and demonstrates practical scalability to very large graphs, aided by the pivotal sparsification design. By situating RSRI relative to Monte Carlo methods, SGD, deterministic sparse methods, and fast randomized iteration, the paper establishes a principled framework for sparsified fixed-point solvers with provable performance, including near-optimal triple-norm error behavior and explicit guidelines for choosing sparsity $m$.

Abstract

Recently, a class of algorithms combining classical fixed point iterations with repeated random sparsification of approximate solution vectors has been successfully applied to eigenproblems with matrices as large as $10^{108} \times 10^{108}$. So far, a complete mathematical explanation for their success has proven elusive. The family of methods has not yet been extended to the important case of linear system solves. In this paper we propose a new scheme based on repeated random sparsification that is capable of solving sparse linear systems in arbitrarily high dimensions. We provide a complete mathematical analysis of this new algorithm. Our analysis establishes a faster-than-Monte Carlo convergence rate and justifies use of the scheme even when the solution vector itself is too large to store.

Randomly sparsified Richardson iteration: A dimension-independent sparse linear solver

TL;DR

This work introduces randomly sparsified Richardson iteration (RSRI), a dimension-independent sparse linear solver that combines Richardson fixed-point iterations with random sparsification to solve

using at most

nonzeros per update. The authors provide a complete mathematical analysis showing faster-than-Monte Carlo convergence, with explicit bias-variance bounds and conditions under which RSRI converges, even when the solution vector is too large to store densely. The PageRank problem is used as a primary application, where RSRI achieves improved variance decay and demonstrates practical scalability to very large graphs, aided by the pivotal sparsification design. By situating RSRI relative to Monte Carlo methods, SGD, deterministic sparse methods, and fast randomized iteration, the paper establishes a principled framework for sparsified fixed-point solvers with provable performance, including near-optimal triple-norm error behavior and explicit guidelines for choosing sparsity

Abstract

. So far, a complete mathematical explanation for their success has proven elusive. The family of methods has not yet been extended to the important case of linear system solves. In this paper we propose a new scheme based on repeated random sparsification that is capable of solving sparse linear systems in arbitrarily high dimensions. We provide a complete mathematical analysis of this new algorithm. Our analysis establishes a faster-than-Monte Carlo convergence rate and justifies use of the scheme even when the solution vector itself is too large to store.

Paper Structure (21 sections, 15 theorems, 127 equations, 2 figures, 4 algorithms)

This paper contains 21 sections, 15 theorems, 127 equations, 2 figures, 4 algorithms.

Introduction
Plan for paper
Notation
Main error bound for RSRI
The PageRank problem
History of Monte Carlo PageRank algorithms
Faster PageRank by RSRI
Empirical tests
Related algorithms
The Monte Carlo method for linear systems
Stochastic gradient descent
Deterministic sparse approximation
Fast randomized iteration
Random sparsification: design and analysis
Pseudocode
...and 6 more sections

Key Result

Theorem 2.1

\newlabelthm:main0 Suppose RSRI with sparsity level $m$ is applied to an $n \times n$ linear system $\bm{A} \bm{x} = \bm{b}$ for which $\bm{G} = \mathbf{I} - \bm{A}$ is a strict $1$-norm contraction: RSRI returns a solution $\overline{\bm{x}}_t$ satisfying the bias-variance formula Here the expectation averages over the random set of entries rounded to zero at each sparsification step. The squa

Figures (2)

Figure 1: (RSRI error scaling.) Left panel shows tail decay $\sum\nolimits_{i = m}^n \bm{x}^{\downarrow}(i)$ of the sorted PageRank solution $\bm{x}^{\downarrow}$ for three personalized PageRank problems documented in \ref{['sec:empirical']}. Right panel shows RSRI error $(\mathbb{E} \lVert \hat{\bm{x}} - \bm{x}_\star \rVert^2)^{1/2}$ with sparsity level $m$. The right panel is essentially the left panel multiplied by a factor of $m^{-1/2}$
Figure 1: (Slow convergence of coordinate descent). Error $\lVert \hat{\bm{x}} - \bm{x} \rVert$ and residual error $\lVert \bm{r}(\hat{\bm{x}}) \rVert_{\infty}$ for coordinate descent with $t$ update steps, compared to theoretical $t^{-1}$ scaling. In the figure, we apply coordinate descent to the Amazon electronics PageRank problem, as documented in \ref{['sec:empirical']}.

Theorems & Definitions (27)

Theorem 2.1: Main error bound
Corollary 2.2: Fast polynomial or exponential convergence
Proof 1
Proposition 3.1: PageRank error bound
Lemma 3.2: Decay of entries in the PageRank vector
Proof 2: Proof of \ref{['lem:decay_rate']}
Theorem 5.1: Near-optimal error
Proposition 5.2: Optimal $L^2$ error
Proof 3: Proof of \ref{['prop:relaxed']}
Lemma 5.3: Difference in norms
...and 17 more

Randomly sparsified Richardson iteration: A dimension-independent sparse linear solver

TL;DR

Abstract

Randomly sparsified Richardson iteration: A dimension-independent sparse linear solver

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (2)

Theorems & Definitions (27)