Scale-Equivariant Imaging: Self-Supervised Learning for Image Super-Resolution and Deblurring
Jérémy Scanvic, Mike Davies, Patrice Abry, Julián Tachella
TL;DR
This work tackles image super-resolution and deblurring in settings where ground-truth high-resolution images are unavailable by introducing scale-equivariant imaging (SEI), a self-supervised framework that leverages scale invariance to recover high-frequency content lost in bandlimited measurements. SEI combines Stein's unbiased risk estimator (SURE) with a scale-equivariant loss (L_SEQ) that uses downscaled reconstructions and gradient stopping to create high-frequency targets without relying on clean references. Theoretically, the authors show that scale transformations enable identifiability of high-frequency information from bandlimited data, a property not shared by roto-translation invariances. Empirically, SEI matches fully supervised performance and outperforms other self-supervised methods across diverse degradations and image distributions, including medical CT data, with promising prospects for fine-tuning and blind-imaging extensions.
Abstract
Self-supervised methods have recently proved to be nearly as effective as supervised ones in various imaging inverse problems, paving the way for learning-based approaches in scientific and medical imaging applications where ground truth data is hard or expensive to obtain. These methods critically rely on invariance to translations and/or rotations of the image distribution to learn from incomplete measurement data alone. However, existing approaches fail to obtain competitive performances in the problems of image super-resolution and deblurring, which play a key role in most imaging systems. In this work, we show that invariance to roto-translations is insufficient to learn from measurements that only contain low-frequency information. Instead, we propose scale-equivariant imaging, a new self-supervised approach that leverages the fact that many image distributions are approximately scale-invariant, enabling the recovery of high-frequency information lost in the measurement process. We demonstrate throughout a series of experiments on real datasets that the proposed method outperforms other self-supervised approaches, and obtains performances on par with fully supervised learning.
