On learning higher-order cumulants in diffusion models
Gert Aarts, Diaa E. Habibi, Lingxiao Wang, Kai Zhou
TL;DR
This work analyzes how higher-order cumulants evolve in diffusion models, addressing whether non-Gaussian correlations are preserved or learned. By deriving explicit moment- and cumulant-generating functionals for both forward and backward processes, it shows that in driftless diffusion higher cumulants $\kappa_{n>2}$ are conserved, while in drifted schemes they decay toward Gaussianity; nonetheless the backward score regenerates the target higher-order structure. The authors verify these predictions in an exactly solvable toy model and in a lattice $\phi^4$ theory, demonstrating accurate learning of non-Gaussian cumulants across many degrees of freedom. The results suggest diffusion models can faithfully capture complex correlations in physics-inspired data, with implications for generating lattice configurations and for refining diffusion-based methods through informed noise scheduling and RG-inspired perspectives.
Abstract
To analyse how diffusion models learn correlations beyond Gaussian ones, we study the behaviour of higher-order cumulants, or connected n-point functions, under both the forward and backward process. We derive explicit expressions for the moment- and cumulant-generating functionals, in terms of the distribution of the initial data and properties of forward process. It is shown analytically that during the forward process higher-order cumulants are conserved in models without a drift, such as the variance-expanding scheme, and that therefore the endpoint of the forward process maintains nontrivial correlations. We demonstrate that since these correlations are encoded in the score function, higher-order cumulants are learnt in the backward process, also when starting from a normal prior. We confirm our analytical results in an exactly solvable toy model with nonzero cumulants and in scalar lattice field theory.
