Fortifying Fully Convolutional Generative Adversarial Networks for Image Super-Resolution Using Divergence Measures
Arkaprabha Basu, Kushal Bose, Sankha Subhra Mullick, Anish Chakrabarty, Swagatam Das
TL;DR
This paper addresses 4x image super-resolution by introducing SuRGe, a fully-convolutional GAN that preserves and adaptive-mixes features from multiple depths of the generator. It uniquely integrates Jensen–Shannon divergence $L^{G}_{JS}$ and Gromov-Wasserstein distance $L^{G}_{GW}$ as auxiliary objectives to align SR with HR and with LR–SR distributions, while the discriminator is trained with Wasserstein loss and gradient penalty to curb mode collapse. The generator loss is dynamically formed as a Softmax-weighted combination of adversarial, JS, and GW terms, enabling balanced optimization, and the architecture employs learnable convex feature mixing $F_{0}$ and $F_{1}$ with two-stage 2x upscaling and nearest-neighbor upsampling to reduce artifacts. Empirical results on DIV2K and 10 benchmarks demonstrate state-of-the-art PSNR/SSIM improvements across diverse datasets, with favorable inference time and parameter efficiency. The work highlights the potential of explicit distributional divergences in guiding SR and suggests avenues for robustness and extension to other scaling factors.
Abstract
Super-Resolution (SR) is a time-hallowed image processing problem that aims to improve the quality of a Low-Resolution (LR) sample up to the standard of its High-Resolution (HR) counterpart. We aim to address this by introducing Super-Resolution Generator (SuRGe), a fully-convolutional Generative Adversarial Network (GAN)-based architecture for SR. We show that distinct convolutional features obtained at increasing depths of a GAN generator can be optimally combined by a set of learnable convex weights to improve the quality of generated SR samples. In the process, we employ the Jensen-Shannon and the Gromov-Wasserstein losses respectively between the SR-HR and LR-SR pairs of distributions to further aid the generator of SuRGe to better exploit the available information in an attempt to improve SR. Moreover, we train the discriminator of SuRGe with the Wasserstein loss with gradient penalty, to primarily prevent mode collapse. The proposed SuRGe, as an end-to-end GAN workflow tailor-made for super-resolution, offers improved performance while maintaining low inference time. The efficacy of SuRGe is substantiated by its superior performance compared to 18 state-of-the-art contenders on 10 benchmark datasets.
