On Quantile Randomized Kaczmarz for Linear Systems with Time-Varying Noise and Corruption
Nestor Coria, Jamie Haddock, Jaime Pacheco
TL;DR
This work studies solving large-scale linear systems $A{\bm{x}}={\bm{b}}$ when measurements are affected by time-varying noise ${\bm{n}}^{(k)}$ and corruption ${\bm{c}}^{(k)}$, observed as ${\bm{b}}^{(k)}={\bm{b}}+{\bm{n}}^{(k)}+{\bm{c}}^{(k)}$. It extends the quantile randomized Kaczmarz (QRK) method, which updates the iterate only when the residual entry lies below a quantile-based threshold, to time-varying perturbations via two practical implementations (QRK1 and QRK2). The authors prove a linear-in-k convergence bound in expectation up to a convergence horizon, characterized by a rate factor $(1-p\varphi)$ and a noise-dependent horizon term $p\gamma_k$, and they provide a Markov-inequality-based lower bound showing that the largest residual entries reveal the corrupted indices with high probability. They also derive corollaries for bounded noise, general noise with known distribution, and Gaussian noise, and validate the theory with extensive numerical experiments demonstrating robustness to time-varying corruption and detection capabilities. These results advance robust, scalable linear-algebra solvers in streaming/distributed settings and enable corruption-detection using residuals.
Abstract
Large-scale systems of linear equations arise in machine learning, medical imaging, sensor networks, and in many areas of data science. When the scale of the systems are extreme, it is common for a fraction of the data or measurements to be corrupted. The Quantile Randomized Kaczmarz (QRK) method is known to converge on large-scale systems of linear equations $A\mathbf{x}=\mathbf{b}$ that are inconsistent due to static corruptions in the measurement vector $\mathbf{b}$. We prove that QRK converges even for systems corrupted by time-varying perturbations. Additionally, we prove that QRK converges up to a convergence horizon on systems affected by time-varying noise and corruption. Finally, we utilize Markov's inequality to prove a lower bound on the probability that the largest entries of the QRK residual reveal the time-varying corruption in each iteration. We present numerical experiments which illustrate our theoretical results.
