Can No-Reference Quality-Assessment Methods Serve as Perceptual Losses for Super-Resolution?
Egor Kashkarov, Egor Chistov, Ivan Molodetskikh, Dmitriy Vatolin
TL;DR
The paper investigates whether no-reference IQA methods can serve as perceptual losses for video super-resolution. By evaluating multiple VSR architectures and a broad set of NR-IQA losses (with comparisons to LPIPS and PieAPP), the study shows that naive optimization often introduces artifacts and can destabilize some IQA signals, while a carefully designed training procedure that combines NR-IQA with LPIPS can mitigate these issues. The findings indicate that NR-IQA losses alone are not reliably beneficial and that cross-IQA interactions must be considered; nonetheless, some NR-IQA methods (e.g., PaQ-2-PiQ, CLIP-IQA) show artifact-free behavior in certain configurations. The work highlights practical implications for designing perceptual losses in VSR and suggests directions for more robust loss-compositions and gradient-based weight selection.
Abstract
Perceptual losses play an important role in constructing deep-neural-network-based methods by increasing the naturalness and realism of processed images and videos. Use of perceptual losses is often limited to LPIPS, a fullreference method. Even though deep no-reference image-qualityassessment methods are excellent at predicting human judgment, little research has examined their incorporation in loss functions. This paper investigates direct optimization of several video-superresolution models using no-reference image-quality-assessment methods as perceptual losses. Our experimental results show that straightforward optimization of these methods produce artifacts, but a special training procedure can mitigate them.
