Preserving Extreme Singular Values with One Oblivious Sketch
John M. Mango, Ronald Katende
TL;DR
The paper investigates whether a single oblivious sketch can uniformly bound the largest and smallest nonzero singular values of every rank-$r$ matrix. It presents a constructive route that pairs a sparse embedding with a deterministic geometric balancing operator to achieve constant-factor control of all nonzero singular values under mild coherence or spectral-gap assumptions, yielding a perfectly conditioned sketch ($\kappa(SA)=1$). In contrast, it proves an oblivious lower bound showing that $s = O(r\log r)$ is insufficient to uniformly preserve both extremes for all rank-$r$ matrices, highlighting a fundamental gap between data-dependent balancing and fully oblivious sketches. Numerical experiments corroborate the theory: balancing improves conditioning and accelerates iterative solvers on well-behaved data, while adversarial/coherent inputs reveal the predicted limits. Overall, the work delineates the boundary between achievable extreme-value preservation via data-dependent techniques and the inherent limitations of oblivious sketches for conditioning-sensitive tasks in numerical linear algebra.
Abstract
We study when a single linear sketch can control the largest and smallest nonzero singular values of every rank-$r$ matrix. Classical oblivious embeddings require $s=Θ(r/\varepsilon^{2})$ for $(1\pm\varepsilon)$ distortion, but this does not yield constant-factor control of extreme singular values or condition numbers. We formalize a conjecture that $s=O(r\log r)$ suffices for such preservation. On the constructive side, we show that combining a sparse oblivious sketch with a deterministic geometric balancing map produces a sketch whose nonzero singular values collapse to a common scale under bounded condition number and coherence. On the negative side, we prove that any oblivious sketch achieving relative $\varepsilon$-accurate singular values for all rank-$r$ matrices must satisfy $s=Ω((r+\log(1/δ))/\varepsilon^{2})$. Numerical experiments on structured matrix families confirm that balancing improves conditioning and accelerates iterative solvers, while coherent or nearly rank-deficient inputs manifest the predicted failure modes.
