Double-Estimation-Friendly Inference for High-Dimensional Measurement Error Models with Non-Sparse Adaptability
Shijie Cui, Xu Guo, Songshan Yang, Zhe Zhang
TL;DR
This work addresses inference for a single coefficient in high-dimensional measurement-error models under potential misspecification of the outcome or exposure models. It develops a double-robust, decorrelated score test $T_{DF}$ based on a double-robust moment condition, with a variance estimator $\widehat{\sigma}$, and proves asymptotic normality under $H_0$ in both low- and high-dimensional regimes. A sparsity-adaptive procedure extends the method to settings where the sparsity assumption is violated by using Dantzig-selector-like estimators and data-driven tuning, preserving validity via the DEF property. Simulation studies and real-data analysis (CAMP) demonstrate size control and nontrivial power under misspecification, highlighting the approach's robustness and practical relevance for high-dimensional epidemiological and genomic data where measurement error is present.
Abstract
In this paper, we introduce an innovative testing procedure for assessing individual hypotheses in high-dimensional linear regression models with measurement errors. This method remains robust even when either the X-model or Y-model is misspecified. We develop a double robust score function that maintains a zero expectation if one of the models is incorrect, and we construct a corresponding score test. We first show the asymptotic normality of our approach in a low-dimensional setting, and then extend it to the high-dimensional models. Our analysis of high-dimensional settings explores scenarios both with and without the sparsity condition, establishing asymptotic normality and non-trivial power performance under local alternatives. Simulation studies and real data analysis demonstrate the effectiveness of the proposed method.
