Taming Diffusion Models for Image Restoration: A Review
Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao, Jens Sjölund, Thomas B. Schön
TL;DR
This review surveys diffusion-model frameworks for image restoration, outlining how forward diffusion, score-based SDEs, and conditional diffusion enable robust HQ recovery from degraded inputs. It categorizes IR approaches into conditional direct diffusion, training-free conditioning, and diffusion processes toward degraded images (IR-SDE and diffusion bridges), and discusses their trade-offs in fidelity, consistency, and efficiency. Key contributions include connecting DDPMs to VP-SDEs, detailing conditional guidance with score-based conditioning, and outlining practical restoration pipelines (e.g., SR3, Palette, StableSR) along with data-consistency strategies like DPS and diffusion-bridge methods. The review highlights challenges such as OOD degradations, texture consistency, and computational cost, and points to future directions in more efficient sampling, flow-based or optimal-transport formulations, and language-guided IR to improve robustness and realism in real-world conditions.
Abstract
Diffusion models have achieved remarkable progress in generative modelling, particularly in enhancing image quality to conform to human preferences. Recently, these models have also been applied to low-level computer vision for photo-realistic image restoration (IR) in tasks such as image denoising, deblurring, dehazing, etc. In this review paper, we introduce key constructions in diffusion models and survey contemporary techniques that make use of diffusion models in solving general IR tasks. Furthermore, we point out the main challenges and limitations of existing diffusion-based IR frameworks and provide potential directions for future work.
