Reflected Diffusion Models
Aaron Lou, Stefano Ermon
TL;DR
Reflected Diffusion Models address the core limitation of standard diffusion models—sampling drift and artifacts caused by thresholding—by formulating diffusion on bounded domains Ω using reflected SDEs. The authors develop constrained denoising score matching (CDSM) to learn the perturbed density scores on Ω, and show that fundamental diffusion tools such as diffusion guidance, likelihood bounds, and probability-flow ODE sampling carry over to the reflected setting. They demonstrate competitive image benchmarks (e.g., CIFAR-10) and scalable diffusion on high-dimensional simplices, while preserving principled data-domain constraints without architectural changes. The work clarifies the relationship between thresholding and reflected dynamics, enabling stable, fast, and faithful sampling under strong guidance and providing a pathway for principled likelihood estimation in constrained generative modeling.
Abstract
Score-based diffusion models learn to reverse a stochastic differential equation that maps data to noise. However, for complex tasks, numerical error can compound and result in highly unnatural samples. Previous work mitigates this drift with thresholding, which projects to the natural data domain (such as pixel space for images) after each diffusion step, but this leads to a mismatch between the training and generative processes. To incorporate data constraints in a principled manner, we present Reflected Diffusion Models, which instead reverse a reflected stochastic differential equation evolving on the support of the data. Our approach learns the perturbed score function through a generalized score matching loss and extends key components of standard diffusion models including diffusion guidance, likelihood-based training, and ODE sampling. We also bridge the theoretical gap with thresholding: such schemes are just discretizations of reflected SDEs. On standard image benchmarks, our method is competitive with or surpasses the state of the art without architectural modifications and, for classifier-free guidance, our approach enables fast exact sampling with ODEs and produces more faithful samples under high guidance weight.
