Harnessing Diffusion-Yielded Score Priors for Image Restoration
Xinqi Lin, Fanghua Yu, Jinfan Hu, Zhiyuan You, Wu Shi, Jimmy S. Ren, Jinjin Gu, Chao Dong
TL;DR
HYPIR introduces a practical image restoration paradigm that leverages pretrained diffusion models as initialization priors and refines them with lightweight adversarial training using LoRA. By bypassing diffusion losses and iterative sampling, the method achieves fast, stable convergence while delivering high-fidelity, realistic restorations and enabling user-controlled prompts, texture richness, and fidelity-generation trade-offs. Theoretical results quantify proximity to the natural image distribution and illustrate benefits such as small initial gradients, broad mode coverage, and accelerated convergence. Empirically, HYPIR outperforms prior state-of-the-art approaches across multiple datasets, scales to large diffusion backbones, and offers flexible, controllable restoration for real-world scenarios.
Abstract
Deep image restoration models aim to learn a mapping from degraded image space to natural image space. However, they face several critical challenges: removing degradation, generating realistic details, and ensuring pixel-level consistency. Over time, three major classes of methods have emerged, including MSE-based, GAN-based, and diffusion-based methods. However, they fail to achieve a good balance between restoration quality, fidelity, and speed. We propose a novel method, HYPIR, to address these challenges. Our solution pipeline is straightforward: it involves initializing the image restoration model with a pre-trained diffusion model and then fine-tuning it with adversarial training. This approach does not rely on diffusion loss, iterative sampling, or additional adapters. We theoretically demonstrate that initializing adversarial training from a pre-trained diffusion model positions the initial restoration model very close to the natural image distribution. Consequently, this initialization improves numerical stability, avoids mode collapse, and substantially accelerates the convergence of adversarial training. Moreover, HYPIR inherits the capabilities of diffusion models with rich user control, enabling text-guided restoration and adjustable texture richness. Requiring only a single forward pass, it achieves faster convergence and inference speed than diffusion-based methods. Extensive experiments show that HYPIR outperforms previous state-of-the-art methods, achieving efficient and high-quality image restoration.
