Optimal Subspace Embeddings: Resolving Nelson-Nguyen Conjecture Up to Sub-Polylogarithmic Factors
Shabarish Chenakkod, Michał Dereziński, Xiaoyu Dong
TL;DR
This work resolves the Nelson–Nguyen conjecture on oblivious subspace embeddings up to sub-polylogarithmic factors by constructing OSNAP-based embeddings with dimension m = 4~O(d/b5^2) and sparsity per column 4~O(b5^{-1} ) that preserve all d-dimensional subspaces with high probability. The authors introduce iterative decoupling, a novel matrix-concentration technique that recursively tightens higher-order trace moments by decoupling powers into products of independent copies, refining Gaussian-universality-based bounds. They develop a sequence of moment estimates culminating in embedding guarantees for OSNAP matrices, and demonstrate implications for fast linear-regression sketches with time 4~O( ext{nnz}(A)ce 8/d). The results unify and surpass prior polylogarithmic gaps, offering a principled, near-optimal pathway to dimension-sparsity trade-offs in subspace embedding, with direct impact on sketching-based linear algebra. Practically, this yields faster sketch-based reductions for large-scale regression and matrix computations, underpinned by a robust probabilistic framework for higher-order moment control.
Abstract
We give a proof of the conjecture of Nelson and Nguyen [FOCS 2013] on the optimal dimension and sparsity of oblivious subspace embeddings, up to sub-polylogarithmic factors: For any $n\geq d$ and $ε\geq d^{-O(1)}$, there is a random $\tilde O(d/ε^2)\times n$ matrix $Π$ with $\tilde O(\log(d)/ε)$ non-zeros per column such that for any $A\in\mathbb{R}^{n\times d}$, with high probability, $(1-ε)\|Ax\|\leq\|ΠAx\|\leq(1+ε)\|Ax\|$ for all $x\in\mathbb{R}^d$, where $\tilde O(\cdot)$ hides only sub-polylogarithmic factors in $d$. Our result in particular implies a new fastest sub-current matrix multiplication time reduction of size $\tilde O(d/ε^2)$ for a broad class of $n\times d$ linear regression tasks. A key novelty in our analysis is a matrix concentration technique we call iterative decoupling, which we use to fine-tune the higher-order trace moment bounds attainable via existing random matrix universality tools [Brailovskaya and van Handel, GAFA 2024].
