Why Diffusion Models Don't Memorize: The Role of Implicit Dynamical Regularization in Training
Tony Bonnaire, Raphaël Urfin, Giulio Biroli, Marc Mézard
TL;DR
The paper tackles why diffusion and score-based generative models avoid memorizing training data while still achieving strong performance. It uncovers two well-separated training timescales, $\tau_{\\mathrm{gen}}$ and $\\tau_{\\mathrm{mem}}$, with $\\tau_{\\mathrm{mem}}$ growing roughly linearly with the dataset size $n$, producing a growing generalization window as $n$ increases. Through extensive experiments on CelebA with U-Nets and a theoretically tractable Random Features Network, the authors show that the memorization phase is an intrinsic dynamical phenomenon rather than a simple consequence of repeated data exposure, and they connect the two timescales to distinct spectral bulks in the RFN analysis. The work highlights implicit dynamical regularization as a core mechanism that promotes generalization in overparameterized diffusion models and provides practical guidance (early stopping, capacity control) to avoid memorization in practice. Overall, it offers a unified framework linking training dynamics, spectral properties, and generalization behavior across score-based generative models.
Abstract
Diffusion models have achieved remarkable success across a wide range of generative tasks. A key challenge is understanding the mechanisms that prevent their memorization of training data and allow generalization. In this work, we investigate the role of the training dynamics in the transition from generalization to memorization. Through extensive experiments and theoretical analysis, we identify two distinct timescales: an early time $τ_\mathrm{gen}$ at which models begin to generate high-quality samples, and a later time $τ_\mathrm{mem}$ beyond which memorization emerges. Crucially, we find that $τ_\mathrm{mem}$ increases linearly with the training set size $n$, while $τ_\mathrm{gen}$ remains constant. This creates a growing window of training times with $n$ where models generalize effectively, despite showing strong memorization if training continues beyond it. It is only when $n$ becomes larger than a model-dependent threshold that overfitting disappears at infinite training times. These findings reveal a form of implicit dynamical regularization in the training dynamics, which allow to avoid memorization even in highly overparameterized settings. Our results are supported by numerical experiments with standard U-Net architectures on realistic and synthetic datasets, and by a theoretical analysis using a tractable random features model studied in the high-dimensional limit.
