Spatial and Frequency Domain Adaptive Fusion Network for Image Deblurring
Hu Gao, Depeng Dang
TL;DR
The paper tackles image deblurring by proposing SFAFNet, a dual-domain architecture that fuses spatial and frequency information through a gated spatial-frequency feature fusion block (GSFFBlock). The GSFFBlock combines a spatial domain module (NAFBlock-based), a learnable, row-wise FDGM low-pass filter to create adaptive frequency subbands, and a gated fusion module (GFM) with gating (GATE) and cross-attention (CAM) to integrate features. Extensive experiments on GoPro, RealBlur, DPDD, and HIDE show state-of-the-art results in motion and defocus deblurring, with comprehensive ablations confirming the contributions of FDGM, GFM, loss terms, and design choices. The work demonstrates that learning to adaptively separate and fuse spatial and frequency information yields superior restoration performance while offering improved efficiency, highlighting a practical path for robust real-world deblurring.
Abstract
Image deblurring aims to reconstruct a latent sharp image from its corresponding blurred one. Although existing methods have achieved good performance, most of them operate exclusively in either the spatial domain or the frequency domain, rarely exploring solutions that fuse both domains. In this paper, we propose a spatial-frequency domain adaptive fusion network (SFAFNet) to address this limitation. Specifically, we design a gated spatial-frequency domain feature fusion block (GSFFBlock), which consists of three key components: a spatial domain information module, a frequency domain information dynamic generation module (FDGM), and a gated fusion module (GFM). The spatial domain information module employs the NAFBlock to integrate local information. Meanwhile, in the FDGM, we design a learnable low-pass filter that dynamically decomposes features into separate frequency subbands, capturing the image-wide receptive field and enabling the adaptive exploration of global contextual information. Additionally, to facilitate information flow and the learning of complementary representations. In the GFM, we present a gating mechanism (GATE) to re-weight spatial and frequency domain features, which are then fused through the cross-attention mechanism (CAM). Experimental results demonstrate that our SFAFNet performs favorably compared to state-of-the-art approaches on commonly used benchmarks.
