Pairwise Distance Distillation for Unsupervised Real-World Image Super-Resolution
Yuehan Zhang, Seungjun Lee, Angela Yao
TL;DR
This work tackles unsupervised real-world image super-resolution under unknown degradations by introducing Pairwise Distance Distillation (PDD), which jointly leverages a specialist trained on synthetic degradations and a generalist trained on broader degradations. PDD enforces two forms of distance consistency in $VGG$ feature space: intra-model distances between a model's predictions on real-world and synthetic inputs, and inter-model distances between the specialist and generalist across domains, using both feature and Gram-matrix statistics. The method optimizes a hybrid loss combining supervised terms on synthetic data and unsupervised distillation terms on real-world data, with two initialization schemes (static and EMA) that affect performance. Empirical results on RealSR, DRealSR, and NTIRE20 show that PDD improves fidelity and perceptual quality, often outperforming state-of-the-art RWSR methods, and are corroborated by user studies; code is provided for reproducibility.
Abstract
Standard single-image super-resolution creates paired training data from high-resolution images through fixed downsampling kernels. However, real-world super-resolution (RWSR) faces unknown degradations in the low-resolution inputs, all the while lacking paired training data. Existing methods approach this problem by learning blind general models through complex synthetic augmentations on training inputs; they sacrifice the performance on specific degradation for broader generalization to many possible ones. We address the unsupervised RWSR for a targeted real-world degradation. We study from a distillation perspective and introduce a novel pairwise distance distillation framework. Through our framework, a model specialized in synthetic degradation adapts to target real-world degradations by distilling intra- and inter-model distances across the specialized model and an auxiliary generalized model. Experiments on diverse datasets demonstrate that our method significantly enhances fidelity and perceptual quality, surpassing state-of-the-art approaches in RWSR. The source code is available at https://github.com/Yuehan717/PDD.
