Estimating the Spectral Density of Large Implicit Matrices
Ryan P. Adams, Jeffrey Pennington, Matthew J. Johnson, Jamie Smith, Yaniv Ovadia, Brian Patton, James Saunderson
TL;DR
This work addresses the challenge of estimating the spectral density of large implicit matrices by casting spectrum inquiries as generalized traces $tr(F(A))$ and employing unbiased randomized Chebyshev expansions. The method combines two key ideas: unbiased estimators of Chebyshev polynomials built from independent matrix-vector estimates and a stochastic trace estimator (Skilling-Hutchinson), augmented with a von Mises kernel to smooth the spectrum and control Gibbs phenomena. It also develops a variance-reduction mechanism via control variates and provides extensive empirical validation on Kneser graphs and classic random-matrix ensembles (Wigner, Wishart, and Wishart+Wigner), showing accurate density recovery under noisy matvecs. The approach yields a practical framework for diagnosing high-dimensional optimization landscapes and dynamic systems where the Hessian or related matrices are large and only accessible implicitly. This paves the way for spectrum-aware analysis in machine learning and statistical modeling with scalable, unbiased estimators.
Abstract
Many important problems are characterized by the eigenvalues of a large matrix. For example, the difficulty of many optimization problems, such as those arising from the fitting of large models in statistics and machine learning, can be investigated via the spectrum of the Hessian of the empirical loss function. Network data can be understood via the eigenstructure of a graph Laplacian matrix using spectral graph theory. Quantum simulations and other many-body problems are often characterized via the eigenvalues of the solution space, as are various dynamic systems. However, naive eigenvalue estimation is computationally expensive even when the matrix can be represented; in many of these situations the matrix is so large as to only be available implicitly via products with vectors. Even worse, one may only have noisy estimates of such matrix vector products. In this work, we combine several different techniques for randomized estimation and show that it is possible to construct unbiased estimators to answer a broad class of questions about the spectra of such implicit matrices, even in the presence of noise. We validate these methods on large-scale problems in which graph theory and random matrix theory provide ground truth.
