SEGA: A Transferable Signed Ensemble Gaussian Black-Box Attack against No-Reference Image Quality Assessment Models
Yujia Liu, Dingquan Li, Zhixuan Li, Tiejun Huang
TL;DR
This work addresses the vulnerability of No-Reference Image Quality Assessment (NR-IQA) models to adversarial perturbations, focusing on the challenge of cross-model transferability in black-box settings. It introduces SEGA, a transferable black-box attack that generates adversarial perturbations by Gaussian-smoothing gradients from multiple source NR-IQA models and ensembling them to approximate the target gradient, followed by a perturbation-filtering step for imperceptibility. The authors prove an upper bound on the gradient-approximation error, $\|\hat{g}(x)-\nabla h(x)\| \le (L\sigma + C/\sigma)\sqrt{2}\frac{\Gamma((d+1)/2)}{\Gamma(d/2)}$, and empirically validate SEGA on CLIVE and KonIQ-10k, showing strong transferability across CNN- and transformer-based NR-IQA models. The results demonstrate SEGA's ability to degrade ranking and score-consistency while maintaining high perceptual similarity (SSIM around 0.85–0.9), highlighting its practical value for robustness evaluation and guiding defenses in NR-IQA systems.
Abstract
No-Reference Image Quality Assessment (NR-IQA) models play an important role in various real-world applications. Recently, adversarial attacks against NR-IQA models have attracted increasing attention, as they provide valuable insights for revealing model vulnerabilities and guiding robust system design. Some effective attacks have been proposed against NR-IQA models in white-box settings, where the attacker has full access to the target model. However, these attacks often suffer from poor transferability to unknown target models in more realistic black-box scenarios, where the target model is inaccessible. This work makes the first attempt to address the challenge of low transferability in attacking NR-IQA models by proposing a transferable Signed Ensemble Gaussian black-box Attack (SEGA). The main idea is to approximate the gradient of the target model by applying Gaussian smoothing to source models and ensembling their smoothed gradients. To ensure the imperceptibility of adversarial perturbations, SEGA further removes inappropriate perturbations using a specially designed perturbation filter mask. Experimental results on the CLIVE dataset demonstrate the superior transferability of SEGA, validating its effectiveness in enabling successful transfer-based black-box attacks against NR-IQA models.
