HDW-SR: High-Frequency Guided Diffusion Model based on Wavelet Decomposition for Image Super-Resolution

Chao Yang; Boqian Zhang; Jinghao Xu; Guang Jiang

HDW-SR: High-Frequency Guided Diffusion Model based on Wavelet Decomposition for Image Super-Resolution

Chao Yang, Boqian Zhang, Jinghao Xu, Guang Jiang

TL;DR

HDW-SR introduces a high-frequency guided diffusion framework for image super-resolution that diffuses only the residual and leverages wavelet-based downsampling to preserve and utilize high-frequency details. The method combines a lossless wavelet sampling-based HDW-Net (HE-Net and HA-Net) with a DFA-based encoder and a Dynamic Thresholding Block to provide sparse, adaptive high-frequency guidance during diffusion. Experimental results on synthetic and real-world datasets show competitive quantitative performance and notably improved detail fidelity, outperforming several diffusion-based and GAN-based baselines in no-reference quality. The approach also supports flexible multi-level wavelet decompositions, offering a practical path to sharper textures and edges in SR.

Abstract

Diffusion-based methods have shown great promise in single image super-resolution (SISR); however, existing approaches often produce blurred fine details due to insufficient guidance in the high-frequency domain. To address this issue, we propose a High-Frequency Guided Diffusion Network based on Wavelet Decomposition (HDW-SR), which replaces the conventional U-Net backbone in diffusion frameworks. Specifically, we perform diffusion only on the residual map, allowing the network to focus more effectively on high-frequency information restoration. We then introduce wavelet-based downsampling in place of standard CNN downsampling to achieve multi-scale frequency decomposition, enabling sparse cross-attention between the high-frequency subbands of the pre-super-resolved image and the low-frequency subbands of the diffused image for explicit high-frequency guidance. Moreover, a Dynamic Thresholding Block (DTB) is designed to refine high-frequency selection during the sparse attention process. During upsampling, the invertibility of the wavelet transform ensures low-loss feature reconstruction. Experiments on both synthetic and real-world datasets demonstrate that HDW-SR achieves competitive super-resolution performance, excelling particularly in recovering fine-grained image details. The code will be available after acceptance.

HDW-SR: High-Frequency Guided Diffusion Model based on Wavelet Decomposition for Image Super-Resolution

TL;DR

Abstract

HDW-SR: High-Frequency Guided Diffusion Model based on Wavelet Decomposition for Image Super-Resolution

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)