Optimal Oblivious Subspace Embeddings with Near-optimal Sparsity
Shabarish Chenakkod, Michał Dereziński, Xiaoyu Dong
TL;DR
The paper resolves a central question in randomized linear algebra by constructing an oblivious subspace embedding with optimal dimension $m=\Theta(d/\varepsilon^2)$ while achieving near-optimal sparsity $s=\tilde O(1/\varepsilon)$ per column, nearly matching the Nelson–Nguyen sparsity conjecture. It introduces LESS-IC, a leverage-score–aware sparse embedding that attains these guarantees with fast application times, and extends the approach to LESS-IE and OSE-IE variants for data-aware or independent-entry models. A novel analytical framework combining a decoupling technique with a two-dimensional interpolation (via chain-rule derivatives) bounds trace moments of embedding errors, yielding sharp $1/\varepsilon$ sparsity while preserving subspace geometry with high probability. The contributions include fast subspace-embedding constructions, improved runtimes for matrix approximation and regression tasks, and a versatile universality toolkit that may influence future random-matrix analyses. Overall, the results significantly advance practical dimensionality reduction by achieving optimal dimension alongside practical sparsity and speed.
Abstract
An oblivious subspace embedding is a random $m\times n$ matrix $Π$ such that, for any $d$-dimensional subspace, with high probability $Π$ preserves the norms of all vectors in that subspace within a $1\pmε$ factor. In this work, we give an oblivious subspace embedding with the optimal dimension $m=Θ(d/ε^2)$ that has a near-optimal sparsity of $\tilde O(1/ε)$ non-zero entries per column of $Π$. This is the first result to nearly match the conjecture of Nelson and Nguyen [FOCS 2013] in terms of the best sparsity attainable by an optimal oblivious subspace embedding, improving on a prior bound of $\tilde O(1/ε^6)$ non-zeros per column [Chenakkod et al., STOC 2024]. We further extend our approach to the non-oblivious setting, proposing a new family of Leverage Score Sparsified embeddings with Independent Columns, which yield faster runtimes for matrix approximation and regression tasks. In our analysis, we develop a new method which uses a decoupling argument together with the cumulant method for bounding the edge universality error of isotropic random matrices. To achieve near-optimal sparsity, we combine this general-purpose approach with new traces inequalities that leverage the specific structure of our subspace embedding construction.
