Valid F-screening in linear regression
Olivia McGough, Daniela Witten, Daniel Kessler
TL;DR
The paper tackles invalid inference after F-screening in linear regression by introducing a conditional selective inference framework that accounts for the initial omnibus test $H_0^{1:p}: \beta_1=\cdots=\beta_p=0$ being rejected. It develops selective p-values $p_{H_0^M\mid E}$ that control the selective Type I error and can be computed from standard regression outputs, including a debiased variance estimator $\tilde{\sigma}^2$ for unknown variance. The authors quantify leftover Fisher information after selection, compare to sample splitting, and demonstrate higher information and power in the selective approach, with extensive simulations and real-data reanalyses (prospective and retrospective). They also provide a practical retrospective analysis pathway using only summary statistics and discuss specialized cases and geometry, concluding with limitations and future extensions. Overall, the framework enables valid, end-to-end selective inference for regression coefficients in the common F-screening scenario, including retrospective corrections for published findings.
Abstract
Suppose that a data analyst wishes to report the results of a least squares linear regression only if the overall null hypothesis, $H_0^{1:p}: β_1= β_2 = \ldots = β_p=0$, is rejected. This practice, which we refer to as F-screening (since the overall null hypothesis is typically tested using an $F$-statistic), is in fact common practice across a number of applied fields. Unfortunately, it poses a problem: standard guarantees for the inferential outputs of linear regression, such as Type 1 error control of hypothesis tests and nominal coverage of confidence intervals, hold unconditionally, but fail to hold conditional on rejection of the overall null hypothesis. In this paper, we develop an inferential toolbox for the coefficients in a least squares model that are valid conditional on rejection of the overall null hypothesis. We develop selective p-values that lead to tests that are consistent and control the selective Type 1 error, i.e., the Type 1 error conditional on having rejected the overall null hypothesis. Furthermore, they can be computed without access to the raw data, i.e., using only the standard outputs of a least squares linear regression, and therefore are suitable for use in a retrospective analysis of a published study. We also develop confidence intervals that attain nominal selective coverage, and point estimates that account for having rejected the overall null hypothesis. We derive an expression for the Fisher information about the coefficients resulting from the proposed approach, and compare this to the Fisher information that results from an alternative approach that relies on sample splitting. We investigate the proposed approach in simulation and via re-analysis of two datasets from the biomedical literature.
