Soft Diffusion: Score Matching for General Corruptions
Giannis Daras, Mauricio Delbracio, Hossein Talebi, Alexandros G. Dimakis, Peyman Milanfar
TL;DR
The paper expands diffusion-based generative modeling beyond additive noise by introducing Soft Score Matching to learn score functions for general linear corruptions such as Gaussian blur and masking. It unifies training objective, sampling, and scheduling to invert degradation processes, achieving state-of-the-art FID on CelebA-64 (1.85) and offering faster sampling than vanilla diffusion. The approach demonstrates that carefully designed corruption operators and principled optimization can significantly improve sample quality and efficiency. It also introduces Momentum Sampler and a probability-flow variant, broadening the practical applicability of diffusion models to a wider class of inverse problems.
Abstract
We define a broader family of corruption processes that generalizes previously known diffusion models. To reverse these general diffusions, we propose a new objective called Soft Score Matching that provably learns the score function for any linear corruption process and yields state of the art results for CelebA. Soft Score Matching incorporates the degradation process in the network. Our new loss trains the model to predict a clean image, \textit{that after corruption}, matches the diffused observation. We show that our objective learns the gradient of the likelihood under suitable regularity conditions for a family of corruption processes. We further develop a principled way to select the corruption levels for general diffusion processes and a novel sampling method that we call Momentum Sampler. We show experimentally that our framework works for general linear corruption processes, such as Gaussian blur and masking. We achieve state-of-the-art FID score $1.85$ on CelebA-64, outperforming all previous linear diffusion models. We also show significant computational benefits compared to vanilla denoising diffusion.
