Non-equilibrium active noise enhances generative memory in diffusion models
Agnish Kumar Behera, Alexandra Lamtyugina, Aditya Nandy, Daiki Goto, Carlos Floyd, Suriyanarayanan Vaikuntanathan
TL;DR
The paper shows that driving diffusion models with non-equilibrium, active noise fundamentally changes information flow, enabling memory of high-level concepts to be stored in temporal correlations of auxiliary variables. By formulating an active forward process and a corresponding reverse diffusion with scores on the active degrees of freedom, the authors demonstrate slower information decay via Fisher memory curves and earlier, robust speciation in the reverse process. Across toy models, alanine dipeptide, and large-scale datasets like MNIST and CIFAR-10, active diffusion yields sharper, more faithful multi-scale structures and improved fidelity (lower FID) without extra training tricks. These results suggest a thermodynamically distinct and practically advantageous route—active generative AI—for exploring rugged energy landscapes and retaining semantic information during sampling, with potential to simplify learning through physics-informed dynamics.
Abstract
Generative diffusion models have emerged as powerful tools for sampling high-dimensional distributions, yet they typically rely on white gaussian noise and noise schedules to destroy and reconstruct information. Here, we demonstrate that driving the generative process out of equilibrium using active, temporally correlated noise sources fundamentally alters the information thermodynamics of the system. We show that coupling the data to an active non-Markovian bath creates a `memory effect' where high-level semantic information (such as class identity or molecular metastability) is stored in the temporal correlations of auxiliary degrees of freedom. Using Fisher information analysis, we prove that this active mechanism significantly retards the rate of information decay compared to passive Brownian motion. Crucially, this memory effect facilitates an earlier and more robust symmetry breaking (speciation) during the reverse generative process, allowing the system to resolve multi-scale structures, reminiscent of metastable states in molecular configurations that are washed out in the typical noising processes. Our results suggest that non-equilibrium protocols, inspired by active matter physics, offer a thermodynamically distinct and potentially advantageous pathway for recovering high-dimensional energy landscapes using generative diffusion.
