Correlation tests and sample spectral coherence matrix in the high-dimensional regime
Philippe Loubaton, Alexis Rosuel, Pascal Vallet
TL;DR
The paper tackles high-dimensional testing of independence among components of a complex Gaussian time series by analyzing the linear spectral statistics of the frequency-smoothed spectral coherence estimator. It develops a rigorous CLT framework in the regime $M=O(N^{\alpha})$, with smoothing span $B$ satisfying $c_N=M/B\to c\in(0,1)$, and provides two frequency-grid statistics with asymptotic normality under $\mathcal{H}_0$. Leveraging Bartlett's factorization, Stieltjes transform tools, and Gaussian concentration, the authors derive representations for $\theta_N(f,\nu)$ and show that $\frac{B\theta_N(f,\nu)}{\sigma_N(f)} \to_d \mathcal{N}(0,1)$ and that the aggregated statistics $\zeta_{N,1}(f)$ and $\zeta_{N,2}(f)$ also converge to Gaussian limits (or Chi-square-type limits for the squares). They further show that replacing the unknown $r_N(\nu)$ with a practical estimator $\hat{r}_N(\nu)$ preserves the CLTs, enabling controlled Type I error and power analyses. Numerical simulations across multiple DGPs, including non-Gaussian innovations and spatial dependence, demonstrate good finite-sample behavior and robustness relative to PGY-type methods. The results provide actionable, spectrally-informed tests for independence in ultra-high-dimensional time series, with clear guidance on regime validity and practical estimation.
Abstract
It is established that the linear spectral statistics (LSS) of the smoothed periodogram estimate of the spectral coherence matrix of a complex Gaussian high-dimensional times series (yn) n$\in$Z with independent components satisfy at each frequency a central limit theorem in the asymptotic regime where the sample size N , the dimension M of the observation, and the smoothing span B both converge towards +$\infty$ in such a way that M = O(N $α$ ) for $α$ < 1 and M B $\rightarrow$ c, c $\in$ (0, 1). It is deduced that two recentered and renormalized versions of the LSS, one based on an average in the frequency domain and the other one based on a sum of squares also in the frequency domain, and both evaluated over a well-chosen frequency grid, also verify a central limit theorem. These two statistics are proposed to test with controlled asymptotic level the hypothesis that the components of y are independent. Numerical simulations assess the performance of the two tests.
