Generalized Linear Spectral Statistics of High-dimensional Sample Covariance Matrices and Its Applications
Yanlin Hu, Qing Yang, Xiao Han
TL;DR
This work develops Generalized Linear Spectral Statistics (GLSS) for high-dimensional sample covariance matrices, introducing $\operatorname{tr}f(\bm{S}_n)\bm{B}_n$ to probe spectral properties with flexible rank-$k_n$ projection. It establishes central limit theorems for GLSS in regimes where $k_n$ is comparable to $n$ or vanishes relative to $n$, including explicit closed-form and general-$(\bm\Sigma_n)$ examples, and provides normalization results to aid practical use. Building on GLSS, the authors propose a Functional Projection approach for eigenspace testing in population-spiked covariance models, achieving asymptotic normality under mild spike assumptions and demonstrating universality even when spike magnitudes are small. Extensive simulations and numerical studies validate the theory, compare against existing methods, and demonstrate robustness and practical viability for high-dimensional spectral inference.
Abstract
In this paper, we introduce the \textbf{G}eneralized \textbf{L}inear \textbf{S}pectral \textbf{S}tatistics (GLSS) of a high-dimensional sample covariance matrix $\bm{S}_n$, denoted as $\operatorname{tr}f(\bm{S}_n)\bm{B}_n$, which effectively captures distinct spectral properties of $\bm{S}_n$ by incorporating an ancillary matrix $\bm{B}_n$ and a test function $f$. The joint asymptotic normality of GLSS associated with different test functions is established under mild assumptions on $\bm{B}_n$ and the underlying distribution, when the dimension $n$ and sample size $N$ are comparable. The convergence rate of GLSS is determined by $\sqrt{{N}/{\operatorname{rank}(\bm{B}_n)}}$. Subsequently, we propose a novel functional projection approach based on GLSS for hypothesis testing on eigenspaces of ``population-spiked'' covariance matrices, showcasing a universality phenomenon in the magnitude of the spikes. The theoretical accuracy of our results established for GLSS and the advantages of the newly suggested testing procedure are demonstrated through various numerical studies.
