Rank of Sparse Bernoulli Matrices
Han Huang
TL;DR
This paper resolves the rank/invertibility behavior of sparse Bernoulli matrices $A_n$ in the regime $\frac{\log n}{n}\le p\le c_{\beta}$, showing that the probability of a small least singular value is governed by local obstructions (zero rows/columns) rather than global structure. The core method blends a refined vector-decomposition into steep ($\mathcal{T}$), robust ($\mathcal{R}$), and gradual ($\mathcal{V}$) classes with a targeted submatrix reduction $A_{I,J}$ and the Litvak–Tikhomirov invertibility via distance framework, augmented by Rogozin anticoncentration and expansion arguments. A key novelty is the treatment of steep vectors through a phase-transition analysis around the critical window $p \asymp \tfrac{\log n}{n}$, including a carefully chosen threshold $s_0$ and a mechanism to ensure a favorable $(I,J)$ submatrix exists with high probability. The main result expresses the precise small-singular-value probability as a sum of an $t$-term and a local-zero-row/column probability, extending LT20 and yielding a sparse-regime counterpart of the strong singularity conjecture for Bernoulli matrices, with application to rank questions and a refined understanding of how local obstructions dominate invertibility. The techniques have broad relevance for random discrete matrices, combining anti-concentration, expansion, and structured-vector partitions to obtain sharp probabilistic invertibility results in the sparse-to-dense transition.
Abstract
Let $ A_n $ be an $n \times n$ random matrix with i.i.d Bernoulli($p$) entries. For a fixed positive integer $β$, suppose $p$ satisfies $$ \frac{ \log(n) }{ n } \le p \le c_β$$ where $c_β\in ( 0, 1/2 )$ is a $β$-dependentvalue. For $t \ge 0$, $$ \mathbb{P} \left\{ s_{ n - β+ 1}(A) \le t n^{-2β+ \mathfrak{n}(1) }(pn)^{-7} \right\} = t + ( 1 + o_\mathfrak{n}(1) ) \mathbb{P} \bigg\{ \mbox{either $β$ rows or $β$ columns of $A_n$ equal $\vec{0}$} \bigg\}. $$
