DeeDSR: Towards Real-World Image Super-Resolution via Degradation-Aware Stable Diffusion
Chunyang Bi, Xin Luo, Sheng Shen, Mengxi Zhang, Huanjing Yue, Jingyu Yang
TL;DR
DeeDSR addresses real-world image SR by integrating degradation-aware priors into a diffusion framework. It first learns degradation semantics through contrastive learning and then guides a frozen Stable Diffusion model with a Degradation-Aware Adapter that fuses global, image-driven degradation prompts with LR content via cross-attention and modulation layers, using Noise Guidance to balance realism and fidelity. The approach achieves state-of-the-art performance on synthetic and real-world benchmarks, with strong semantic preservation and competitive perceptual quality, validated by quantitative metrics and user studies. This work demonstrates that image-driven degradation representations can substantially enhance the semantic fidelity of diffusion-based SR, offering a practical path to robust real-world SR applications.
Abstract
Diffusion models, known for their powerful generative capabilities, play a crucial role in addressing real-world super-resolution challenges. However, these models often focus on improving local textures while neglecting the impacts of global degradation, which can significantly reduce semantic fidelity and lead to inaccurate reconstructions and suboptimal super-resolution performance. To address this issue, we introduce a novel two-stage, degradation-aware framework that enhances the diffusion model's ability to recognize content and degradation in low-resolution images. In the first stage, we employ unsupervised contrastive learning to obtain representations of image degradations. In the second stage, we integrate a degradation-aware module into a simplified ControlNet, enabling flexible adaptation to various degradations based on the learned representations. Furthermore, we decompose the degradation-aware features into global semantics and local details branches, which are then injected into the diffusion denoising module to modulate the target generation. Our method effectively recovers semantically precise and photorealistic details, particularly under significant degradation conditions, demonstrating state-of-the-art performance across various benchmarks. Codes will be released at https://github.com/bichunyang419/DeeDSR.
