Augmenting Perceptual Super-Resolution via Image Quality Predictors

Fengjia Zhang; Samrudhdhi B. Rangrej; Tristan Aumentado-Armstrong; Afsaneh Fazly; Alex Levinshtein

Augmenting Perceptual Super-Resolution via Image Quality Predictors

Fengjia Zhang, Samrudhdhi B. Rangrej, Tristan Aumentado-Armstrong, Afsaneh Fazly, Alex Levinshtein

TL;DR

This work tackles the ill-posed nature of single-image super-resolution by leveraging no-reference IQA predictors to guide training, either through IQA-weighted sampling of multiple enhanced ground-truths or through differentiable optimization of image quality. By systematically analyzing NR-IQA metrics on SBS180K and HGGT, the authors identify MUSIQ (and complementary metrics like NIMA and Q-Align) as robust signals and implement two NR-IQA-based strategies: reweighted GT sampling (SMA, SMP, AMO) and direct optimization with regularization via LoRA. The combination of Argmax-online sampling and NR-IQA-guided fine-tuning (AMO+FT) achieves state-of-the-art perceptual-quality SR without human annotations, outperforming human-guided positives-only baselines on NR metrics and receiving favorable user-study preferences. The results demonstrate a scalable path to enhancing perceptual SR quality through existing NR-IQA models, with practical implications for real-world SR under domain shift and subjective quality assessments.

Abstract

Super-resolution (SR), a classical inverse problem in computer vision, is inherently ill-posed, inducing a distribution of plausible solutions for every input. However, the desired result is not simply the expectation of this distribution, which is the blurry image obtained by minimizing pixelwise error, but rather the sample with the highest image quality. A variety of techniques, from perceptual metrics to adversarial losses, are employed to this end. In this work, we explore an alternative: utilizing powerful non-reference image quality assessment (NR-IQA) models in the SR context. We begin with a comprehensive analysis of NR-IQA metrics on human-derived SR data, identifying both the accuracy (human alignment) and complementarity of different metrics. Then, we explore two methods of applying NR-IQA models to SR learning: (i) altering data sampling, by building on an existing multi-ground-truth SR framework, and (ii) directly optimizing a differentiable quality score. Our results demonstrate a more human-centric perception-distortion tradeoff, focusing less on non-perceptual pixel-wise distortion, instead improving the balance between perceptual fidelity and human-tuned NR-IQA measures.

Augmenting Perceptual Super-Resolution via Image Quality Predictors

TL;DR

Abstract

Augmenting Perceptual Super-Resolution via Image Quality Predictors

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)