Diffusion Model Guided Sampling with Pixel-Wise Aleatoric Uncertainty Estimation
Michele De Vita, Vasileios Belagiannis
TL;DR
We address the lack of quantitative uncertainty in diffusion-model image generation by introducing a training-free, pixel-wise aleatoric uncertainty estimate computed as the variance of denoising scores under a diffusion-specific perturbation during sampling. This uncertainty is shown to be linked to the second derivative (curvature) of the noising distribution, enabling a principled guidance of the sampling process to emphasize high-uncertainty regions and improve sample quality. Through extensive experiments on ImageNet and CIFAR-10, the method both filters out low-quality samples and achieves better FID scores compared to baselines like MC-Dropout and BayesDiff, with substantially fewer function evaluations. The approach is scheduler-agnostic and compatible with unconditional, class-conditional, and text-to-image diffusion models, offering a practical, training-free tool for improving diffusion-based generation and reliability.
Abstract
Despite the remarkable progress in generative modelling, current diffusion models lack a quantitative approach to assess image quality. To address this limitation, we propose to estimate the pixel-wise aleatoric uncertainty during the sampling phase of diffusion models and utilise the uncertainty to improve the sample generation quality. The uncertainty is computed as the variance of the denoising scores with a perturbation scheme that is specifically designed for diffusion models. We then show that the aleatoric uncertainty estimates are related to the second-order derivative of the diffusion noise distribution. We evaluate our uncertainty estimation algorithm and the uncertainty-guided sampling on the ImageNet and CIFAR-10 datasets. In our comparisons with the related work, we demonstrate promising results in filtering out low quality samples. Furthermore, we show that our guided approach leads to better sample generation in terms of FID scores.
