Estimating a graph's spectrum via random Kirchhoff forests
Simon Barthelmé, Fabienne Castell, Alexandre Gaudillière, Clothilde Melot, Matteo Quattropani, Nicolas Tremblay
TL;DR
The paper tackles scalable estimation of a graph's spectrum without full eigendecomposition. It introduces Kirchhoff forests to estimate non-linear spectral moments $h(q,k)=\mu((q/(q+\lambda))^k)$ across a grid of $q$ and small $k$, and then reconstructs the spectral density using a maximum-entropy approach on the transformed measure $\nu_q$, aggregating to a spectral cdf. The key contributions include a forest-based estimator with complexity $\mathcal{O}((\alpha+n_\lambda) s l n)$ that can be sublinear in $|\mathcal{E}|$ for modest precision, a reconstruction pipeline for the spectral cdf, and empirical results showing practical speedups over baselines in the moderate-accuracy regime. This approach enables approximate spectrum estimation on very large graphs where exact eigen-decomposition is infeasible, with potential impact for graph signal processing and spectral analyses.
Abstract
Exact eigendecomposition of large matrices is very expensive, and it is practically impossible to compute exact eigenvalues. Instead, one may set a more modest goal of approaching the empirical distribution of the eigenvalues, recovering the overall shape of the eigenspectrum. Current approaches to spectral estimation typically work with \emph{moments} of the spectral distribution. These moments are first estimated using Monte Carlo trace estimators, then the estimates are combined to approximate the spectral density. In this article we show how \emph{Kirchhoff forests}, which are random forests on graphs, can be used to estimate certain non-linear moments of very large graph Laplacians. We show how to combine these moments into an estimate of the spectral density. If the estimate's desired precision isn't too high, our approach paves the way to the estimation of a graph's spectrum in time sublinear in the number of links.
