Table of Contents
Fetching ...

Frequency-domain Learning with Kernel Prior for Blind Image Deblurring

Jixiang Sun, Fei Lei, Jiawei Zhang, Wenxiu Sun, Yujiu Yang

TL;DR

This work tackles blind image deblurring by incorporating explicit blur kernel priors into a frequency-domain Transformer. It introduces a Kernel Estimation Module to predict per-pixel kernels and a Frequency Integration Module that fuses kernel information in the frequency domain through a novel Frequency Attention mechanism, augmented by a multi-scale encoder–decoder integration. The approach yields state-of-the-art results on GoPro, HIDE, and RealBlur benchmarks and demonstrates robust generalization to out-of-domain data, albeit with higher computational cost. Overall, the combination of kernel priors and frequency-domain fusion offers a practical path toward more generalizable deblurring models with explicit degradation information.

Abstract

While achieving excellent results on various datasets, many deep learning methods for image deblurring suffer from limited generalization capabilities with out-of-domain data. This limitation is likely caused by their dependence on certain domain-specific datasets. To address this challenge, we argue that it is necessary to introduce the kernel prior into deep learning methods, as the kernel prior remains independent of the image context. For effective fusion of kernel prior information, we adopt a rational implementation method inspired by traditional deblurring algorithms that perform deconvolution in the frequency domain. We propose a module called Frequency Integration Module (FIM) for fusing the kernel prior and combine it with a frequency-based deblurring Transfomer network. Experimental results demonstrate that our method outperforms state-of-the-art methods on multiple blind image deblurring tasks, showcasing robust generalization abilities. Source code will be available soon.

Frequency-domain Learning with Kernel Prior for Blind Image Deblurring

TL;DR

This work tackles blind image deblurring by incorporating explicit blur kernel priors into a frequency-domain Transformer. It introduces a Kernel Estimation Module to predict per-pixel kernels and a Frequency Integration Module that fuses kernel information in the frequency domain through a novel Frequency Attention mechanism, augmented by a multi-scale encoder–decoder integration. The approach yields state-of-the-art results on GoPro, HIDE, and RealBlur benchmarks and demonstrates robust generalization to out-of-domain data, albeit with higher computational cost. Overall, the combination of kernel priors and frequency-domain fusion offers a practical path toward more generalizable deblurring models with explicit degradation information.

Abstract

While achieving excellent results on various datasets, many deep learning methods for image deblurring suffer from limited generalization capabilities with out-of-domain data. This limitation is likely caused by their dependence on certain domain-specific datasets. To address this challenge, we argue that it is necessary to introduce the kernel prior into deep learning methods, as the kernel prior remains independent of the image context. For effective fusion of kernel prior information, we adopt a rational implementation method inspired by traditional deblurring algorithms that perform deconvolution in the frequency domain. We propose a module called Frequency Integration Module (FIM) for fusing the kernel prior and combine it with a frequency-based deblurring Transfomer network. Experimental results demonstrate that our method outperforms state-of-the-art methods on multiple blind image deblurring tasks, showcasing robust generalization abilities. Source code will be available soon.

Paper Structure

This paper contains 15 sections, 5 equations, 4 figures, 4 tables.

Figures (4)

  • Figure 1: Overview of the network architecture of our proposed method. (a) The structure of the Kernel Estimation Module. (b) The structure of our proposed Frequency Integration Module (FIM). (c) The structure of the full pipeline in our proposed method.
  • Figure 2: Visual comparison on GoPro nah2017deep dataset. We compare our method with MIMO-UNet+ cho2021rethinking, Stripformer tsai2022stripformer, Restormer-Local zamir2022restormer, NAFNet chen2022simple, and FFTFormer kong2023efficient. Models are trained only on the GoPro dataset. Our network generates more realistic images with clearer details.
  • Figure 3: Visual comparison on RealBlur rim2020real dataset. We compare our method with MIMO-UNet+ cho2021rethinking, NAFNet chen2022simple, Restormer-Local zamir2022restormer, Stripformer tsai2022stripformer, and FFTFormer kong2023efficient. Models are trained only on the GoPro dataset. Our network generates more realistic images with clearer details.
  • Figure 4: Visual comparison on HIDE shen2019human dataset. We compare our method with MIMO-UNet+ cho2021rethinking, NAFNet chen2022simple, Restormer-Local zamir2022restormer, Stripformer tsai2022stripformer, and FFTFormer kong2023efficient. Models are trained only on the GoPro dataset. Our network generates more realistic images with clearer details.