The statistical thermodynamics of generative diffusion models: Phase transitions, symmetry breaking and critical instability
Luca Ambrogioni
TL;DR
This work reframes generative diffusion as an equilibrium statistical-mechanics problem, introducing Boltzmann ensembles over noiseless data conditioned on noisy states and deriving a self-consistent equation of state for the generative dynamics. It demonstrates that diffusion-based generation undergoes second-order phase transitions of mean-field universality, driven by spontaneous symmetry breaking and characterized by diverging susceptibility and canonical critical exponents. The authors extend the mean-field picture with a multi-site 'generative bath' to explore beyond-mean-field behavior, connect diffusion to associative memory/Hopfield networks, and analyze finite-sample effects through a random-energy REM lens, including memorization via condensation. Experimental evidence from trained diffusion models supports phase-transition–like behavior, including hierarchical feature emergence and sensitivity around critical times, with implications for sample diversity, memorization, and generation quality.
Abstract
Generative diffusion models have achieved spectacular performance in many areas of machine learning and generative modeling. While the fundamental ideas behind these models come from non-equilibrium physics, variational inference and stochastic calculus, in this paper we show that many aspects of these models can be understood using the tools of equilibrium statistical mechanics. Using this reformulation, we show that generative diffusion models undergo second-order phase transitions corresponding to symmetry breaking phenomena. We show that these phase-transitions are always in a mean-field universality class, as they are the result of a self-consistency condition in the generative dynamics. We argue that the critical instability that arises from the phase transitions lies at the heart of their generative capabilities, which are characterized by a set of mean-field critical exponents. Finally, we show that the dynamic equation of the generative process can be interpreted as a stochastic adiabatic transformation that minimizes the free energy while keeping the system in thermal equilibrium.
