Universality of estimators for high-dimensional linear models with block dependency
Toshiki Tsuda, Masaaki Imaizumi
TL;DR
This paper tackles the challenge of universality for estimators in high-dimensional linear models when covariates exhibit block dependence rather than Gaussianity. It introduces a generalized Lindeberg principle to handle block structure and develops an explicit error bound that links the universality gap to the block size $d$ and smoothness properties of the loss and regularizer. The main results show that, under suitable growth conditions and moment assumptions, the distribution of estimators under block-dependent covariates matches that under Gaussian covariates with the same moments, including for robust estimators like absolute loss and Huber loss as well as standard regularized methods such as Lasso and Ridge. The work also provides precise application results and numerical experiments demonstrating universality even when block sizes are as large as the parameter dimension $p$, highlighting the practical impact for high-dimensional inference under realistic dependency structures.
Abstract
We study the universality property of estimators for high-dimensional linear models, which implies that the distribution of estimators is independent of whether the covariates follow a Gaussian distribution. Recent developments in high-dimensional statistics typically require covariates to strictly follow a Gaussian distribution to precisely characterize the properties of estimators. To relax this Gaussianity requirement, the existing literature has examined conditions under which estimators achieve universality. In particular, independence among the elements of the high-dimensional covariates has played a critical role. In this study, we focus on high-dimensional linear models with covariates exhibiting block dependence, where covariate elements can only be dependent within each block, and show that estimators for such models retain universality. Specifically, we prove that the distribution of estimators with Gaussian covariates can be approximated by the distribution of estimators with non-Gaussian covariates having the same moments under block dependence. To establish this result, we develop a generalized Lindeberg principle suitable for handling block dependencies and derive new error bounds for correlated covariate elements. We further demonstrate the universality result across several different estimators.
