Valid Heteroskedasticity Robust Testing
Benedikt M. Pötscher, David Preinerstorfer
TL;DR
This paper tackles the persistent problem that heteroskedasticity-robust tests in linear regression often overreject when conventional asymptotic critical values are used. It proves the existence of smallest size-controlling critical values $C(\alpha)$ for both unrestricted- and restricted-residual test statistics, provides verifiable conditions for their existence, and offers algorithms (with an R package $\texttt{hrt}$) to compute them. The authors show that using these critical values yields valid level-$\alpha$ tests and can improve power relative to standard approaches, while also identifying situations where certain tests become trivial under size control. The framework extends beyond Gaussian errors to elliptical and semiparametric settings and accommodates stochastic regressors, broadening practical applicability. Overall, the work delivers a principled, computable approach to robust hypothesis testing under unknown forms of heteroskedasticity with clear guidance for practitioners.
Abstract
Tests based on heteroskedasticity robust standard errors are an important technique in econometric practice. Choosing the right critical value, however, is not simple at all: conventional critical values based on asymptotics often lead to severe size distortions; and so do existing adjustments including the bootstrap. To avoid these issues, we suggest to use smallest size-controlling critical values, the generic existence of which we prove in this article for the commonly used test statistics. Furthermore, sufficient and often also necessary conditions for their existence are given that are easy to check. Granted their existence, these critical values are the canonical choice: larger critical values result in unnecessary power loss, whereas smaller critical values lead to over-rejections under the null hypothesis, make spurious discoveries more likely, and thus are invalid. We suggest algorithms to numerically determine the proposed critical values and provide implementations in accompanying software. Finally, we numerically study the behavior of the proposed testing procedures, including their power properties.
