Non-Normal Diffusion Models
Henry Li
TL;DR
The paper generalizes diffusion models by relaxing the conventional Gaussian assumption on diffusion increments $Δ\mathbf{x}_k$, proving an invariance principle that ensures convergence to a diffusion process as time steps shrink. This leads to a flexible framework where forward and backward updates can use arbitrary increment distributions (e.g., Gaussian, Laplace, Uniform) without sacrificing tractable training, via corresponding KL-based loss terms. It derives explicit loss forms for several combinations of increment and model distributions, showing both competitive likelihoods and qualitative differences in generated samples. The work broadens the methodological toolkit for score-based diffusion, enabling richer stylistic control and potential improvements in density estimation and image generation. It also highlights avenues for future theoretical analysis of score matching under alternative norms and the broader implications for diffusion-model design.
Abstract
Diffusion models generate samples by incrementally reversing a process that turns data into noise. We show that when the step size goes to zero, the reversed process is invariant to the distribution of these increments. This reveals a previously unconsidered parameter in the design of diffusion models: the distribution of the diffusion step $Δx_k := x_{k} - x_{k + 1}$. This parameter is implicitly set by default to be normally distributed in most diffusion models. By lifting this assumption, we generalize the framework for designing diffusion models and establish an expanded class of diffusion processes with greater flexibility in the choice of loss function used during training. We demonstrate the effectiveness of these models on density estimation and generative modeling tasks on standard image datasets, and show that different choices of the distribution of $Δx_k$ result in qualitatively different generated samples.
