Blind denoising diffusion models and the blessings of dimensionality
Zahra Kadkhodaie, Aram-Alexandre Pooladian, Sinho Chewi, Eero Simoncelli
TL;DR
This paper analyzes blind denoising diffusion models (BDDMs) and shows that when data have low intrinsic dimensionality, the blind denoiser can implicitly recover the noise schedule and sample from the data distribution in polynomial time relative to the intrinsic dimension k. It provides a rigorous Bayesian interpretation, derives an implicit schedule σ_t^2 = σ_0^2 e^{−2t} + 2∫_0^t a_s e^{−2(t−s)} ds, and demonstrates that careful discretization (e.g., exponential Euler with a_t = 𝒶 σ_t^2) yields stability and favorable error bounds that depend on k rather than ambient dimension d. Empirically, BDMMs accurately estimate noise variance from a single noisy image and produce higher-quality samples than non-blind baselines, with demonstrations on synthetic Gaussian mixtures and real image datasets (CelebA and LSUN Bedroom). The findings suggest that BDDMs can simplify training and sampling by removing the need for explicit noise-conditioning and can benefit applications requiring robust, perceptually faithful generation and inverse-problem solving.
Abstract
We analyze, theoretically and empirically, the performance of generative diffusion models based on \emph{blind denoisers}, in which the denoiser is not given the noise amplitude in either the training or sampling processes. Assuming that the data distribution has low intrinsic dimensionality, we prove that blind denoising diffusion models (BDDMs), despite not having access to the noise amplitude, \emph{automatically} track a particular \emph{implicit} noise schedule along the reverse process. Our analysis shows that BDDMs can accurately sample from the data distribution in polynomially many steps as a function of the intrinsic dimension. Empirical results corroborate these mathematical findings on both synthetic and image data, demonstrating that the noise variance is accurately estimated from the noisy image. Remarkably, we observe that schedule-free BDDMs produce samples of higher quality compared to their non-blind counterparts. We provide evidence that this performance gain arises because BDDMs correct the mismatch between the true residual noise (of the image) and the noise assumed by the schedule used in non-blind diffusion models.
