Retinex-guided Histogram Transformer for Mask-free Shadow Removal
Wei Dong, Han Zhou, Seyed Amirreza Mousavi, Jun Chen
TL;DR
The paper tackles shadow removal without relying on shadow masks, addressing generalization gaps in mask-based approaches. It introduces ReHiT, a dual-branch Retinex-based pipeline with an Illumination-Guided Hybrid CNN-Transformer (IG-HCT) and an Illumination-Guided Histogram Transformer Block (IG-HTB) to handle non-uniform illumination. The approach jointly models reflectance and illumination, using DRDB and SAM blocks for multi-scale fusion and a histogram self-attention mechanism guided by illumination. Experiments on ISTD, ISTD+, WSRD+ and the NTIRE 2025 Shadow Removal Challenge show competitive performance with much smaller parameter counts and faster inference, highlighting practical efficiency for real-world deployment.
Abstract
While deep learning methods have achieved notable progress in shadow removal, many existing approaches rely on shadow masks that are difficult to obtain, limiting their generalization to real-world scenes. In this work, we propose ReHiT, an efficient mask-free shadow removal framework based on a hybrid CNN-Transformer architecture guided by Retinex theory. We first introduce a dual-branch pipeline to separately model reflectance and illumination components, and each is restored by our developed Illumination-Guided Hybrid CNN-Transformer (IG-HCT) module. Second, besides the CNN-based blocks that are capable of learning residual dense features and performing multi-scale semantic fusion, multi-scale semantic fusion, we develop the Illumination-Guided Histogram Transformer Block (IGHB) to effectively handle non-uniform illumination and spatially complex shadows. Extensive experiments on several benchmark datasets validate the effectiveness of our approach over existing mask-free methods. Trained solely on the NTIRE 2025 Shadow Removal Challenge dataset, our solution delivers competitive results with one of the smallest parameter sizes and fastest inference speeds among top-ranked entries, highlighting its applicability for real-world applications with limited computational resources. The code is available at https://github.com/dongw22/oath.
