Optimal Denoising in Score-Based Generative Models: The Role of Data Regularity
Eliot Beyler, Francis Bach
TL;DR
The paper investigates one-step denoising in score-based generative models, contrasting full-denoising with half-denoising and examining how data regularity vs. manifold structure affects performance. It develops rigorous bounds showing half-denoising achieves $O(\sigma^4)$ accuracy in both MMD and $W_2$ distances under regular densities, while full-denoising yields $O(\sigma^2)$ accuracy and can outperform in singular settings such as Dirac mixtures or low-dimensional supports. The analysis extends to subspace scenarios and mixtures, revealing a trade-off between correcting to the subspace and reducing distortion on the subspace itself; in particular, full-denoising can alleviate the curse of dimensionality when the data lie on low-dimensional linear structures. The results offer practical guidance for single-step and multi-step diffusion-model design (e.g., DDIM and related samplers), suggesting that denoiser choice should be adapted to the target density’s regularity and subspace geometry, with potential extensions to linear-manifold and more general manifold settings.
Abstract
Score-based generative models achieve state-of-the-art sampling performance by denoising a distribution perturbed by Gaussian noise. In this paper, we focus on a single deterministic denoising step, and compare the optimal denoiser for the quadratic loss, we name ''full-denoising'', to the alternative ''half-denoising'' introduced by Hyv{ä}rinen (2024). We show that looking at the performances in term of distance between distribution tells a more nuanced story, with different assumptions on the data leading to very different conclusions. We prove that half-denoising is better than full-denoising for regular enough densities, while full-denoising is better for singular densities such as mixtures of Dirac measures or densities supported on a low-dimensional subspace. In the latter case, we prove that full-denoising can alleviate the curse of dimensionality under a linear manifold hypothesis.
