ShadowRefiner: Towards Mask-free Shadow Removal via Fast Fourier Transformer
Wei Dong, Han Zhou, Yuqiong Tian, Jingke Sun, Xiaohong Liu, Guangtao Zhai, Jun Chen
TL;DR
ShadowRefiner targets mask-free shadow removal by combining a ConvNext-based Shadow Removal module that leverages spatial and frequency representations with a novel Fast Fourier Attention Transformer (FFAT) in a Refinement module to enhance texture and color consistency. The two-stage architecture first removes shadows and then refines details via FFT-based attention, achieving state-of-the-art results on NTIRE 2024 (Perceptual Track winner, Fidelity Track runner-up) and strong performance on ISTD/ISTD+/WSRD+ datasets. Ablation confirms the FFAT refinement's crucial role in improving fidelity and texture while preserving colors, with extensive experiments supporting effectiveness over mask-free baselines and competitive performance against mask-based methods. The work demonstrates the value of integrating spatial-frequency analysis and FFT-domain attention in shadow removal, offering practical impact for real-world scenes and downstream vision tasks.
Abstract
Shadow-affected images often exhibit pronounced spatial discrepancies in color and illumination, consequently degrading various vision applications including object detection and segmentation systems. To effectively eliminate shadows in real-world images while preserving intricate details and producing visually compelling outcomes, we introduce a mask-free Shadow Removal and Refinement network (ShadowRefiner) via Fast Fourier Transformer. Specifically, the Shadow Removal module in our method aims to establish effective mappings between shadow-affected and shadow-free images via spatial and frequency representation learning. To mitigate the pixel misalignment and further improve the image quality, we propose a novel Fast-Fourier Attention based Transformer (FFAT) architecture, where an innovative attention mechanism is designed for meticulous refinement. Our method wins the championship in the Perceptual Track and achieves the second best performance in the Fidelity Track of NTIRE 2024 Image Shadow Removal Challenge. Besides, comprehensive experiment result also demonstrate the compelling effectiveness of our proposed method. The code is publicly available: https://github.com/movingforward100/Shadow_R.
