Fast Debiasing of the LASSO Estimator
Shuvayan Banerjee, James Saunderson, Radhendushka Srivastava, Ajit Rajwade
TL;DR
This work tackles bias in the Lasso estimator for high-dimensional sparse regression and the computational bottleneck of debiasing via the approximate inverse $\boldsymbol{M}$. It introduces a re-parameterization to the debiasing weight matrix $\boldsymbol{W} = \boldsymbol{A}\boldsymbol{M}^{\top}$, and derives a simple closed-form, unique solution for $\boldsymbol{W}$ under mild conditions on $\mu$ and the sensing matrix $\boldsymbol{A}$. By focusing on the product $\boldsymbol{A}\boldsymbol{M}^{\top}$ rather than $\boldsymbol{M}$ itself, the approach preserves the asymptotic debiasing guarantees while eliminating iterative optimization. Empirical results show that the closed-form $\boldsymbol{W}_e$ achieves the same inference performance as the iterative $\boldsymbol{W}_o$ but with orders-of-magnitude faster computation, making it attractive for streaming high-dimensional inference. The method is particularly well-suited for ensembles of i.i.d. sub-Gaussian rows with diagonal covariance $\boldsymbol{\Sigma}$, enabling scalable and rapid debiasing in practice.
Abstract
In high-dimensional sparse regression, the \textsc{Lasso} estimator offers excellent theoretical guarantees but is well-known to produce biased estimates. To address this, \cite{Javanmard2014} introduced a method to ``debias" the \textsc{Lasso} estimates for a random sub-Gaussian sensing matrix $\boldsymbol{A}$. Their approach relies on computing an ``approximate inverse" $\boldsymbol{M}$ of the matrix $\boldsymbol{A}^\top \boldsymbol{A}/n$ by solving a convex optimization problem. This matrix $\boldsymbol{M}$ plays a critical role in mitigating bias and allowing for construction of confidence intervals using the debiased \textsc{Lasso} estimates. However the computation of $\boldsymbol{M}$ is expensive in practice as it requires iterative optimization. In the presented work, we re-parameterize the optimization problem to compute a ``debiasing matrix" $\boldsymbol{W} := \boldsymbol{AM}^{\top}$ directly, rather than the approximate inverse $\boldsymbol{M}$. This reformulation retains the theoretical guarantees of the debiased \textsc{Lasso} estimates, as they depend on the \emph{product} $\boldsymbol{AM}^{\top}$ rather than on $\boldsymbol{M}$ alone. Notably, we provide a simple, computationally efficient, closed-form solution for $\boldsymbol{W}$ under similar conditions for the sensing matrix $\boldsymbol{A}$ used in the original debiasing formulation, with an additional condition that the elements of every row of $\boldsymbol{A}$ have uncorrelated entries. Also, the optimization problem based on $\boldsymbol{W}$ guarantees a unique optimal solution, unlike the original formulation based on $\boldsymbol{M}$. We verify our main result with numerical simulations.
