Gaps between Singular Values of Sample Covariance Matrices
Nicholas Christoffersen, Kyle Luh, Sean O'Rourke, Calum Shearer
TL;DR
The paper analyzes gaps between consecutive singular values of Σ^{1/2}M with M an n×p random matrix and Σ an n×n deterministic PD matrix, proving that small gaps are unlikely and that the spectrum is simple with high probability. The core methodology blends anti-concentration bounds, regularized LCD techniques, and ε-net arguments to control both compressible and incompressible singular vectors, enabling lower bounds on the squared-singular-value spacings that depend on the location in the spectrum. A principal contribution is resolving Vu's conjecture on simple singular value spectra for random matrices and deriving quantitative gap bounds that reflect edge vs. bulk behavior, consistent with Marchenko–Pastur-type limits. The results have practical implications for numerical linear algebra and graph isomorphism-related problems, including polynomial-time solvability of GI on most bipartite graphs via the non-repetition of singular values.
Abstract
We study the gaps between consecutive singular values of random rectangular matrices. Specifically, if $M$ is an $n \times p$ random matrix with independent and identically distributed entries and $Σ$ is a $n \times n$ deterministic positive definite matrix, then under some technical assumptions we give lower bounds for the gaps between consecutive singular values of $Σ^{1/2} M$. As a consequence, we show that sample covariance matrices have simple spectrum with high probability. Our results resolve a conjecture of Vu [{\em Probab. Surv.}, 18:179--200, 2021]. We also discuss some applications, including a bound on the spacings of eigenvalues of the adjacency matrix of random bipartite graphs.
