DeblurDiff: Real-World Image Deblurring with Generative Diffusion Models
Lingshun Kong, Jiawei Zhang, Dongqing Zou, Jimmy Ren, Xiaohe Wu, Jiangxin Dong, Jinshan Pan
TL;DR
The paper addresses real-world image deblurring by leveraging pre-trained Stable Diffusion priors without conditioning directly on blurred inputs. It introduces a Latent Kernel Prediction Network (LKPN) that learns pixel-wise kernels in latent space and applies them through Element-wise Adaptive Convolution (EAC) to progressively restore structure, with iterative refinement guided by diffusion outputs. Conditioning is implemented via a ControlNet-style branch that fuses LKPN-derived latent guidance with the blurred input, and LKPN is trained jointly with the diffusion model using latent and pixel-space losses. Extensive experiments on synthetic and real-world datasets demonstrate that DeblurDiff achieves superior perceptual quality and structural fidelity compared to state-of-the-art methods, validating robustness to diverse blur patterns.
Abstract
Diffusion models have achieved significant progress in image generation. The pre-trained Stable Diffusion (SD) models are helpful for image deblurring by providing clear image priors. However, directly using a blurry image or pre-deblurred one as a conditional control for SD will either hinder accurate structure extraction or make the results overly dependent on the deblurring network. In this work, we propose a Latent Kernel Prediction Network (LKPN) to achieve robust real-world image deblurring. Specifically, we co-train the LKPN in latent space with conditional diffusion. The LKPN learns a spatially variant kernel to guide the restoration of sharp images in the latent space. By applying element-wise adaptive convolution (EAC), the learned kernel is utilized to adaptively process the input feature, effectively preserving the structural information of the input. This process thereby more effectively guides the generative process of Stable Diffusion (SD), enhancing both the deblurring efficacy and the quality of detail reconstruction. Moreover, the results at each diffusion step are utilized to iteratively estimate the kernels in LKPN to better restore the sharp latent by EAC. This iterative refinement enhances the accuracy and robustness of the deblurring process. Extensive experimental results demonstrate that the proposed method outperforms state-of-the-art image deblurring methods on both benchmark and real-world images.
