Spectral Properties of Elementwise-Transformed Spiked Matrices
Michael J. Feldman
TL;DR
The paper analyzes the spectral properties of elementwise-transformed spiked matrices, proving a PCA phase transition in high dimensions when recovering a low-rank signal from $Y_n = n^{-1/2} f(\sqrt{n} X_n + Z_n)$. By expanding $f$ in an orthogonal polynomial basis with respect to the noise law $\mu$, it shows that the transformed data behave like a standard spiked model with an effective signal strength $\tau(f,\mu)$, yielding Marchenko–Pastur limiting spectra and explicit outlier behavior for the singular values and vectors; a zero-threshold case is handled via a higher-order $\ell_*$-theory. The work applies to nonlinear, discontinuous transforms and includes concrete applications to binomial data, ReLU activation, and data truncation, deriving practical thresholds and optimal preprocessing/shrinkage rules. Overall, it extends spiked matrix theory to nonlinear elementwise transforms, providing rigorous guidance for PCA in non-Gaussian, discrete, or truncated high-dimensional settings and informing data preprocessing strategies to improve signal recovery.
Abstract
This work concerns elementwise-transformations of spiked matrices: $Y_n = n^{-1/2} f( \sqrt{n} X_n + Z_n)$. Here, $f$ is a function applied elementwise, $X_n$ is a low-rank signal matrix, and $Z_n$ is white noise. We find that principal component analysis is powerful for recovering signal under highly nonlinear or discontinuous transformations. Specifically, in the high-dimensional setting where $Y_n$ is of size $n \times p$ with $n,p \rightarrow \infty$ and $p/n \rightarrow γ> 0$, we uncover a phase transition: for signal-to-noise ratios above a sharp threshold -- depending on $f$, the distribution of elements of $Z_n$, and the limiting aspect ratio $γ$ -- the principal components of $Y_n$ (partially) recover those of $X_n$. Below this threshold, the principal components of $Y_n$ are asymptotically orthogonal to the signal. In contrast, in the standard setting where $X_n + n^{-1/2}Z_n$ is observed directly, the analogous phase transition depends only on $γ$. A similar phenomenon occurs with $X_n$ square and symmetric and $Z_n$ a generalized Wigner matrix.
