Manifolds, Random Matrices and Spectral Gaps: The geometric phases of generative diffusion
Enrico Ventura, Beatrice Achilli, Gianluigi Silvestri, Carlo Lucibello, Luca Ambrogioni
TL;DR
The paper investigates the latent geometry of generative diffusion under the manifold hypothesis by analyzing the spectrum of the Jacobian of the score function. Employing random-matrix theory for linear-manifold data, it derives spectral distributions and gap formulas, then validates them with experiments on synthetic linear data and real image models. The authors identify three qualitative diffusion phases—trivial, manifold coverage, and manifold consolidation—and show how intermediate and final spectral gaps reveal subspace structure and manifold dimensionality, offering an explanation for why diffusion models avoid manifold overfitting. This framework connects the local geometry of diffusion trajectories to global data structure, providing a quantitative lens for understanding diffusion dynamics and guiding future analyses of latent manifolds in high-dimensional generative models.
Abstract
In this paper, we investigate the latent geometry of generative diffusion models under the manifold hypothesis. For this purpose, we analyze the spectrum of eigenvalues (and singular values) of the Jacobian of the score function, whose discontinuities (gaps) reveal the presence and dimensionality of distinct sub-manifolds. Using a statistical physics approach, we derive the spectral distributions and formulas for the spectral gaps under several distributional assumptions, and we compare these theoretical predictions with the spectra estimated from trained networks. Our analysis reveals the existence of three distinct qualitative phases during the generative process: a trivial phase; a manifold coverage phase where the diffusion process fits the distribution internal to the manifold; a consolidation phase where the score becomes orthogonal to the manifold and all particles are projected on the support of the data. This `division of labor' between different timescales provides an elegant explanation of why generative diffusion models are not affected by the manifold overfitting phenomenon that plagues likelihood-based models, since the internal distribution and the manifold geometry are produced at different time points during generation.
