Certifying Euclidean Sections and Finding Planted Sparse Vectors Beyond the $\sqrt{n}$ Dimension Threshold
Venkatesan Guruswami, Jun-Ting Hsieh, Prasad Raghavendra
TL;DR
This work addresses certifying that a random $d$-dimensional subspace of $\mathbb{R}^n$ is well-spread, i.e., has distortion $\Delta(X) \le O(1)$, in a regime where $d$ can exceed the classic $\sqrt{n}$ barrier. It introduces a subexponential-time certification framework that interpolates between the polynomial-time $d\le\tilde{O}(\sqrt{n})$ regime and the information-hard regime, via a novel combination of the $2\to 4$ norm proxy, trimming of large coordinates, and control of elementary symmetric polynomials $P_t$. The same machinery extends to the planted sparse-vector problem, offering a subexponential-time algorithm that recovers a sparse planted vector under a random subspace model by leveraging Sum-of-Squares and a careful rounding strategy. Collectively, the results establish a smooth runtime-dimension trade-off, provide subexponential algorithms in a hard regime, and connect certification techniques to planted recovery through SoS-based rounding.
Abstract
We consider the task of certifying that a random $d$-dimensional subspace $X$ in $\mathbb{R}^n$ is well-spread - every vector $x \in X$ satisfies $c\sqrt{n} \|x\|_2 \leq \|x\|_1 \leq \sqrt{n}\|x\|_2$. In a seminal work, Barak et. al. showed a polynomial-time certification algorithm when $d \leq O(\sqrt{n})$. On the other hand, when $d \gg \sqrt{n}$, the certification task is information-theoretically possible but there is evidence that it is computationally hard [MW21,Cd22], a phenomenon known as the information-computation gap. In this paper, we give subexponential-time certification algorithms in the $d \gg \sqrt{n}$ regime. Our algorithm runs in time $\exp(\widetilde{O}(n^{\varepsilon}))$ when $d \leq \widetilde{O}(n^{(1+\varepsilon)/2})$, establishing a smooth trade-off between runtime and the dimension. Our techniques naturally extend to the related planted problem, where the task is to recover a sparse vector planted in a random subspace. Our algorithm achieves the same runtime and dimension trade-off for this task.
