DeepRFTv2: Kernel-level Learning for Image Deblurring

Xintian Mao; Haofei Song; Yin-Nian Liu; Qingli Li; Yan Wang

DeepRFTv2: Kernel-level Learning for Image Deblurring

Xintian Mao, Haofei Song, Yin-Nian Liu, Qingli Li, Yan Wang

TL;DR

DeepRFTv2 addresses the kernel-level nature of blur by introducing Fourier Kernel Estimator (FKE) and coupling it with a Decoupled Multi-Scale UNet (DMS-UNet) to perform end-to-end kernel-level learning for deblurring. By operating in Fourier space, FKE converts convolution into a multiplicative operation on frequency features, enabling global kernel estimation without supervision and direct convolution with learned features. The DMS-UNet design incorporates reversible sub-units to enable efficient multi-scale processing and mitigate information aliasing, delivering strong empirical performance across motion and defocus blur benchmarks. The work demonstrates that kernel-level learning yields physically meaningful kernels and superior restoration quality, with potential applicability to other kernel-related image restoration tasks.

Abstract

It is well-known that if a network aims to learn how to deblur, it should understand the blur process. Blurring is naturally caused by the convolution of the sharp image with the blur kernel. Thus, allowing the network to learn the blur process in the kernel-level can significantly improve the image deblurring performance. But, current deep networks are still at the pixel-level learning stage, either performing end-to-end pixel-level restoration or stage-wise pseudo kernel-level restoration, failing to enable the deblur model to understand the essence of the blur. To this end, we propose Fourier Kernel Estimator (FKE), which considers the activation operation in Fourier space and converts the convolution problem in the spatial domain to a multiplication problem in Fourier space. Our FKE, jointly optimized with the deblur model, enables the network to learn the kernel-level blur process with low complexity and without any additional supervision. Furthermore, we change the convolution object of the kernel from ``image" to network extracted ``feature", whose rich semantic and structural information is more suitable to blur process learning. With the convolution of the feature and the estimated kernel, our model can learn the essence of blur in kernel-level. To further improve the efficiency of feature extraction, we design a decoupled multi-scale architecture with multiple hierarchical sub-unets with a reversible strategy, which allows better multi-scale encoding and decoding in low training memory. Extensive experiments indicate that our method achieves state-of-the-art motion deblurring results and show potential for handling other kernel-related problems. Analysis also shows our kernel estimator is able to learn physically meaningful kernels. The code will be available at https://github.com/DeepMed-Lab-ECNU/Single-Image-Deblur.

DeepRFTv2: Kernel-level Learning for Image Deblurring

TL;DR

Abstract

DeepRFTv2: Kernel-level Learning for Image Deblurring

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)