Dual-domain Adaptation Networks for Realistic Image Super-resolution
Chaowei Fang, Bolin Fu, De Cheng, Lechao Cheng, Guanbin Li
TL;DR
This paper tackles realistic image super-resolution by bridging the gap between models trained on synthetic data and real-world degradations. It introduces Dual-domain Adaptation Networks (DAN), which combine a Spatial-Domain Adaptation (SDA) strategy with a Frequency-Domain Adaptation (FDA) branch to adapt pre-trained SR backbones (e.g., SwinIR) to real LR-HR pairs. SDA uses selective parameter updating with low-rank adapters to preserve low-level features while adapting to real data, and FDA merges FFT-based spectral information with backbone features to recover high-frequency details. Extensive experiments on RealSR, D2CRealSR, and DRealSR demonstrate state-of-the-art performance with far fewer trainable parameters than full fine-tuning, including robust cross-camera adaptation; ablation analyses validate the importance of FDA, SDA, and LoRA components. The work advances practical SR by enabling efficient transfer from simulated to realistic domains, with potential impact on surveillance, medical imaging, and consumer electronics where real degradations are prevalent.
Abstract
Realistic image super-resolution (SR) focuses on transforming real-world low-resolution (LR) images into high-resolution (HR) ones, handling more complex degradation patterns than synthetic SR tasks. This is critical for applications like surveillance, medical imaging, and consumer electronics. However, current methods struggle with limited real-world LR-HR data, impacting the learning of basic image features. Pre-trained SR models from large-scale synthetic datasets offer valuable prior knowledge, which can improve generalization, speed up training, and reduce the need for extensive real-world data in realistic SR tasks. In this paper, we introduce a novel approach, Dual-domain Adaptation Networks, which is able to efficiently adapt pre-trained image SR models from simulated to real-world datasets. To achieve this target, we first set up a spatial-domain adaptation strategy through selectively updating parameters of pre-trained models and employing the low-rank adaptation technique to adjust frozen parameters. Recognizing that image super-resolution involves recovering high-frequency components, we further integrate a frequency domain adaptation branch into the adapted model, which combines the spectral data of the input and the spatial-domain backbone's intermediate features to infer HR frequency maps, enhancing the SR result. Experimental evaluations on public realistic image SR benchmarks, including RealSR, D2CRealSR, and DRealSR, demonstrate the superiority of our proposed method over existing state-of-the-art models. Codes are available at: https://github.com/dummerchen/DAN.
