NTIRE 2023 Image Shadow Removal Challenge Technical Report: Team IIM_TTI
Yuki Kondo, Riku Miyata, Fuma Yasue, Taito Naruki, Norimichi Ukita
TL;DR
The paper analyzes ShadowFormer for NTIRE 2023 Shadow Removal and introduces five enhancements to tackle misalignment and shadow-mask scarcity: homography-based image alignment, perceptual quality losses, SASMA for semi-automatic shadow masking, joint detection–removal training, and CutShadow augmentation. The integrated pipeline emphasizes perceptual fidelity and contextual consistency, achieving competitive metrics ($0.196$ LPIPS and $7.44$ MOS) while addressing practical misalignment seen in real-world data. Although PSNR/SIM metrics may drop on misaligned GT, the approach improves shadow removal quality and scene structure preservation, illustrating robust performance under challenging conditions. These contributions advance shadow removal in realistically misaligned data and offer practical strategies for perceptual optimization and data-efficient learning.
Abstract
In this paper, we analyze and discuss ShadowFormer in preparation for the NTIRE2023 Shadow Removal Challenge [1], implementing five key improvements: image alignment, the introduction of a perceptual quality loss function, the semi-automatic annotation for shadow detection, joint learning of shadow detection and removal, and the introduction of new data augmentation technique "CutShadow" for shadow removal. Our method achieved scores of 0.196 (3rd out of 19) in LPIPS and 7.44 (4th out of 19) in the Mean Opinion Score (MOS).
