Randomly Pivoted Partial Cholesky: Random How?
Stefan Steinerberger
TL;DR
The paper addresses efficient low-rank approximation of symmetric positive-definite matrices using randomly pivoted partial Cholesky. It analyzes pivot strategies under a Gibbs-like scheme with exponent $\beta$, proving that sampling with probability proportional to $A_{ii}^2$ yields contraction in the Frobenius norm: $\mathbb{E}\|B\|_F^2 \le \|A\|_F^2 - \frac{1}{\sum_i A_{ii}^2}\sum_i \|A_i\|_2^4 \le (1 - 1/n)\|A\|_F^2$, with the bound sharp for diagonal $A$ and strengthened by column-norm fluctuations. The work also connects to greedy pivoting, provides bounds that justify large diagonal choices, and proposes alternating pivoting between greedy and random strategies to hedge against misleading diagonal cues. Through preparatory computations and a key inequality $A_{ii}(A^3)_{ii} \ge \|A_i\|_2^4$, the authors derive rigorous proofs of the main theorem and corollaries, situating the results within the broader literature on randomized low-rank matrix approximations and kernel methods. The findings offer practical guidance for pivot selection in large-scale SPD matrices where entry evaluations are costly. $ $
Abstract
We consider the problem of finding good low rank approximations of symmetric, positive-definite $A \in \mathbb{R}^{n \times n}$. Chen-Epperly-Tropp-Webber showed, among many other things, that the randomly pivoted partial Cholesky algorithm that chooses the $i-$th row with probability proportional to the diagonal entry $A_{ii}$ leads to a universal contraction of the trace norm (the Schatten 1-norm) in expectation for each step. We show that if one chooses the $i-$th row with likelihood proportional to $A_{ii}^2$ one obtains the same result in the Frobenius norm (the Schatten 2-norm). Implications for the greedy pivoting rule and pivot selection strategies are discussed.
