Cross-fluctuation phase transitions reveal sampling dynamics in diffusion models
Sai Niranjan Ramachandran, Manish Krishan Lal, Suvrit Sra
TL;DR
This work introduces cross-fluctuation mergers, a centred-moment statistic $\mathcal{M}^{(n)}_{\rho}$, to detect discrete phase-like transitions in the sampling dynamics of score-based diffusion models. By tracking forward diffusion and the conditional fluctuation tensors of two user-defined events, the authors reveal sharp mergers at specific steps, enabling early stopping, class-wise interval guidance, and improvements in zero-shot classification and style transfer without retraining. They derive a closed form for cross-fluctuations in variance-preserving SDEs, connect the approach to centered kernel alignment and classical coupling, and demonstrate practical gains across acceleration, conditional generation, rare-class synthesis, zero-shot classification, and zero-shot style transfer. The framework unifies discrete Markov-chain concepts with continuous stochastic dynamics, offering a scalable, theory-grounded toolkit for diagnosing and improving diffusion-model sampling. This bridges statistical-physics intuition with modern generative modeling, enabling efficient, targeted interventions that improve performance across multiple downstream tasks.
Abstract
We analyse how the sampling dynamics of distributions evolve in score-based diffusion models using cross-fluctuations, a centered-moment statistic from statistical physics. Specifically, we show that starting from an unbiased isotropic normal distribution, samples undergo sharp, discrete transitions, eventually forming distinct events of a desired distribution while progressively revealing finer structure. As this process is reversible, these transitions also occur in reverse, where intermediate states progressively merge, tracing a path back to the initial distribution. We demonstrate that these transitions can be detected as discontinuities in $n^{\text{th}}$-order cross-fluctuations. For variance-preserving SDEs, we derive a closed-form for these cross-fluctuations that is efficiently computable for the reverse trajectory. We find that detecting these transitions directly boosts sampling efficiency, accelerates class-conditional and rare-class generation, and improves two zero-shot tasks--image classification and style transfer--without expensive grid search or retraining. We also show that this viewpoint unifies classical coupling and mixing from finite Markov chains with continuous dynamics while extending to stochastic SDEs and non Markovian samplers. Our framework therefore bridges discrete Markov chain theory, phase analysis, and modern generative modeling.
