What's the score? Automated Denoising Score Matching for Nonlinear Diffusions
Raghav Singhal, Mark Goldstein, Rajesh Ranganath
TL;DR
This paper tackles the challenge of training score-based diffusion models when the forward transition is nonlinear and intractable by introducing Local Denoising Score Matching (local-DSM). Local-DSM builds the training objective from local increments of the transition kernel $q(\mathbf{y}_t|\mathbf{y}_s)$ and uses local linearization to yield Gaussian transitions, enabling automated score estimation without explicit solutions for nonlinear $q(\mathbf{y}_t|\mathbf{y}_0)$. It derives mean and covariance equations via matrix exponentials, proposes error-control schedules for the Taylor approximation, and provides algorithms plus extensions (perceptual weighting, score modeling) with theoretical error bounds. Empirically, Local-DSM achieves faster training and improved sample quality on low-dimensional and CIFAR-10 tasks with non-Gaussian priors, and it enables score estimation for nonlinear processes in physics and related sciences, broadening the practical reach of diffusion-based modeling.
Abstract
Reversing a diffusion process by learning its score forms the heart of diffusion-based generative modeling and for estimating properties of scientific systems. The diffusion processes that are tractable center on linear processes with a Gaussian stationary distribution. This limits the kinds of models that can be built to those that target a Gaussian prior or more generally limits the kinds of problems that can be generically solved to those that have conditionally linear score functions. In this work, we introduce a family of tractable denoising score matching objectives, called local-DSM, built using local increments of the diffusion process. We show how local-DSM melded with Taylor expansions enables automated training and score estimation with nonlinear diffusion processes. To demonstrate these ideas, we use automated-DSM to train generative models using non-Gaussian priors on challenging low dimensional distributions and the CIFAR10 image dataset. Additionally, we use the automated-DSM to learn the scores for nonlinear processes studied in statistical physics.
