Fundamental Limits of Spectral Clustering in Stochastic Block Models
Anderson Ye Zhang
TL;DR
This work establishes the fundamental limits of spectral clustering for community detection in Stochastic Block Models under extreme sparsity, showing that the clustering error decays exponentially with an exact exponent J_min. The authors develop a novel truncated ℓ2 perturbation analysis of eigenvectors, paired with an eigenvector truncation mechanism and leave-one-out decoupling, to obtain matching upper and lower bounds with identical leading constants (up to small log factors). The main results quantify the precise exponential rate in terms of the model parameters, including the community sizes {n_a}, within-same-community probability p, and cross-community probability q, via J_min. These findings sharply characterize when spectral clustering achieves exact recovery and illuminate the technique's limitations in very sparse regimes, providing a rigorous, instance-wise understanding beyond minimax perspectives.
Abstract
Spectral clustering has been widely used for community detection in network sciences. While its empirical successes are well-documented, a clear theoretical understanding, particularly for sparse networks where degrees are much smaller than $\log n$, remains unclear. In this paper, we address this significant gap by demonstrating that spectral clustering offers exponentially small error rates when applied to sparse networks under Stochastic Block Models. Our analysis provides sharp characterizations of its performance, backed by matching upper and lower bounds possessing an identical exponent with the same leading constant. The key to our results is a novel truncated $\ell_2$ perturbation analysis for eigenvectors, coupled with a new analysis idea of eigenvectors truncation.
