Improving Image De-raining Using Reference-Guided Transformers
Zihao Ye, Jaehoon Cho, Changjae Oh
TL;DR
The paper tackles single-image de-raining by introducing a reference-guided de-raining filter (RDF) that augments existing de-raining models with a reference clean image. RDF comprises a feature extractor, a feature attention module, and a feature fusion module to transfer useful features from the reference $R_c$ to the baseline derained output $ hat{X}_c$, guided by cross-scale attention and fusion. A two-stage training strategy uses an $L_1$ loss for initialization and a MS-SSIM-L1 loss for fine-tuning, with $\alpha_1=0.6$ and $\alpha_2=0.4$, enabling robust feature transfer. Experiments on BDD100K-Rain, Cityscapes-Rain, and KITTI-Rain show consistent improvements for GMM, PReNet, and Uformer baselines, demonstrating RDF's plug-and-play applicability and potential to improve real-world outdoor vision systems.
Abstract
Image de-raining is a critical task in computer vision to improve visibility and enhance the robustness of outdoor vision systems. While recent advances in de-raining methods have achieved remarkable performance, the challenge remains to produce high-quality and visually pleasing de-rained results. In this paper, we present a reference-guided de-raining filter, a transformer network that enhances de-raining results using a reference clean image as guidance. We leverage the capabilities of the proposed module to further refine the images de-rained by existing methods. We validate our method on three datasets and show that our module can improve the performance of existing prior-based, CNN-based, and transformer-based approaches.
