Emphasizing Crucial Features for Efficient Image Restoration
Hu Gao, Bowen Ma, Ying Zhang, Jingfan Yang, Jing Yang, Depeng Dang
TL;DR
This work tackles image restoration when degradation varies across regions by introducing ECFNet, a CNN-based U-Net variant that integrates a spatial-frequency attention mechanism (SFAM) and a three-branch multi-scale block (MSBlock). SFAM decomposes into SDAM, which locates degradation through spatial and channel cues, and FDAM, which amplifies high-frequency information to highlight spectral differences between sharp and degraded regions; MSBlock enables global dependencies at high resolution, while subsequent scales use MSSFBlocks for efficiency. The model is trained with a multi-term loss that combines reconstruction, edge, and frequency-domain terms, and is validated across three tasks—image dehazing, defocus deblurring, and desnowing—showing state-of-the-art performance with substantial efficiency gains on both synthetic and real-world datasets. The findings demonstrate the practical potential of region-aware restoration and multi-scale CNN architectures for robust, real-time image restoration in diverse environments.
Abstract
Image restoration is a challenging ill-posed problem which estimates latent sharp image from its degraded counterpart. Although the existing methods have achieved promising performance by designing novelty architecture of module, they ignore the fact that different regions in a corrupted image undergo varying degrees of degradation. In this paper, we propose an efficient and effective framework to adapt to varying degrees of degradation across different regions for image restoration. Specifically, we design a spatial and frequency attention mechanism (SFAM) to emphasize crucial features for restoration. SFAM consists of two modules: the spatial domain attention module (SDAM) and the frequency domain attention module (FDAM). The SFAM discerns the degradation location through spatial selective attention and channel selective attention in the spatial domain, while the FDAM enhances high-frequency signals to amplify the disparities between sharp and degraded image pairs in the spectral domain. Additionally, to capture global range information, we introduce a multi-scale block (MSBlock) that consists of three scale branches, each containing multiple simplified channel attention blocks (SCABlocks) and a multi-scale feed-forward block (MSFBlock). Finally, we propose our ECFNet, which integrates the aforementioned components into a U-shaped backbone for recovering high-quality images. Extensive experimental results demonstrate the effectiveness of ECFNet, outperforming state-of-the-art (SOTA) methods on both synthetic and real-world datasets.
