A Simple Combination of Diffusion Models for Better Quality Trade-Offs in Image Denoising
Jonas Dornbusch, Emanuel Pfarr, Florin-Alexandru Vasluianu, Frank Werner, Radu Timofte
TL;DR
The paper tackles Gaussian denoising with diffusion models and introduces LCDD, a simple linear-combination approach that inserts a noisy input at an intermediate diffusion state and fuses outputs from short and long inference schedules using a scalar $\lambda$. By using a single pretrained network to handle multiple noise levels and varying the inference schedule length, LCDD achieves favorable distortion-perception trade-offs without additional training. The method delivers state-of-the-art or competitive results across multiple benchmarks (FFHQ, ImageNet, BSD68, McMaster) in PSNR, FID, and LPIPS, and qualitative results show detailed, natural reconstructions. This work provides a practical, parameter-light strategy for balancing distortion and perceptual quality in diffusion-based image restoration with broad applicability and speed advantages.
Abstract
Diffusion models have garnered considerable interest in computer vision, owing both to their capacity to synthesize photorealistic images and to their proven effectiveness in image reconstruction tasks. However, existing approaches fail to efficiently balance the high visual quality of diffusion models with the low distortion achieved by previous image reconstruction methods. Specifically, for the fundamental task of additive Gaussian noise removal, we first illustrate an intuitive method for leveraging pretrained diffusion models. Further, we introduce our proposed Linear Combination Diffusion Denoiser (LCDD), which unifies two complementary inference procedures - one that leverages the model's generative potential and another that ensures faithful signal recovery. By exploiting the inherent structure of the denoising samples, LCDD achieves state-of-the-art performance and offers controlled, well-behaved trade-offs through a simple scalar hyperparameter adjustment.
