Adversarial purification for no-reference image-quality metrics: applicability study and new methods
Aleksandr Gushchin, Anna Chistyakova, Vladislav Minashkin, Anastasia Antsiferova, Dmitriy Vatolin
TL;DR
The paper addresses the vulnerability of no-reference image quality assessment (NR-IQA) metrics to adversarial perturbations and explores whether purification defenses developed for image classifiers can defend NR-IQA. It adapts 10 attack methods to NR-IQA, introduces 16 purification techniques including diffusion-based methods (DiffPure) and a novel FCN filter, and constructs an adversarial dataset based on the NIPS 2017 benchmark to evaluate defenses against three NR-IQA metrics, primarily Linearity. Key findings show that diffusion-based purifications, especially DiffPure with optional unsharp masking, achieve high image quality and preserve correlation with subjective quality, while simple transformations like rotation or flipping can be surprisingly effective; the FCN filter offers strong defense against colour-based attacks (AdvCF). The work demonstrates transferability of purification strategies to NR-IQA, provides a benchmark dataset for adversarial NR-IQA, and suggests directions toward provable defenses for robust IQA metrics with practical impact on benchmarks and optimization tasks in vision systems.
Abstract
Recently, the area of adversarial attacks on image quality metrics has begun to be explored, whereas the area of defences remains under-researched. In this study, we aim to cover that case and check the transferability of adversarial purification defences from image classifiers to IQA methods. In this paper, we apply several widespread attacks on IQA models and examine the success of the defences against them. The purification methodologies covered different preprocessing techniques, including geometrical transformations, compression, denoising, and modern neural network-based methods. Also, we address the challenge of assessing the efficacy of a defensive methodology by proposing ways to estimate output visual quality and the success of neutralizing attacks. Defences were tested against attack on three IQA metrics -- Linearity, MetaIQA and SPAQ. The code for attacks and defences is available at: (link is hidden for a blind review).
