Asymptotic distributions of four linear hypotheses test statistics under generalized spiked model
Zhijun Liu, Jiang Hu, Zhidong Bai, Zhihui Lv
TL;DR
The paper advances high-dimensional hypothesis testing by establishing a central limit theorem for linear spectral statistics under a generalized spiked covariance model, allowing spiked eigenvalues to be either bounded or diverging. It then derives asymptotic distributions for four classical linear-test statistics—Wilks' $U$, Lawley-Hotelling $W$, Bartlett-Nanda-Pillai $V$, and Roy's largest root $R$ (the latter used for comparison)—using Taylor expansions to handle complex test functions. The results are complemented by explicit asymptotic means and variances that incorporate spike structure and bulk behavior, as well as asymptotic power functions illustrating when each test is most powerful relative to Roy's root. Numerical simulations validate the theoretical approximations under several covariance models and distributions, highlighting the practical relevance for high-dimensional inference in applications with possibly unbounded population spectra.
Abstract
In this paper, we establish the Central Limit Theorem (CLT) for linear spectral statistics (LSSs) of large-dimensional generalized spiked sample covariance matrices, where the spiked eigenvalues may be either bounded or diverge to infinity. Building upon this theorem, we derive the asymptotic distributions of linear hypothesis test statistics under the generalized spiked model, including Wilks' likelihood ratio test statistic U, the Lawley-Hotelling trace test statistic W, and the Bartlett-Nanda-Pillai trace test statistic V. Due to the complexity of the test functions, explicit solutions for the contour integrals in our calculations are generally intractable. To address this, we employ Taylor series expansions to approximate the theoretical results in the asymptotic regime. We also derive asymptotic power functions for three test criteria above, and make comparisons with Roy's largest root test under specific scenarios. Finally, numerical simulations are conducted to validate the accuracy of our asymptotic approximations.
