Sliced Maximal Information Coefficient: A Training-Free Approach for Image Quality Assessment Enhancement
Kang Xiao, Xu Wang, Yulin He, Baoliang Chen, Xuelin Shen
TL;DR
This work tackles misalignment between FR-IQA predictions and human perception by introducing a training-free attention mechanism, SMIC, that estimates inter-image statistical dependency in deep feature space. SMIC uses sliced mutual information with random projections to generate attention maps that weight local distortion cues from existing IQA measures, improving their alignment with human judgments. The approach is validated across six IQA datasets and multiple baselines (including GAN- and SR-based distortions), demonstrating robust, model-agnostic gains without any training. The method is practical and interpretable, offering a scalable path to more MOS-consistent image quality assessment.
Abstract
Full-reference image quality assessment (FR-IQA) models generally operate by measuring the visual differences between a degraded image and its reference. However, existing FR-IQA models including both the classical ones (eg, PSNR and SSIM) and deep-learning based measures (eg, LPIPS and DISTS) still exhibit limitations in capturing the full perception characteristics of the human visual system (HVS). In this paper, instead of designing a new FR-IQA measure, we aim to explore a generalized human visual attention estimation strategy to mimic the process of human quality rating and enhance existing IQA models. In particular, we model human attention generation by measuring the statistical dependency between the degraded image and the reference image. The dependency is captured in a training-free manner by our proposed sliced maximal information coefficient and exhibits surprising generalization in different IQA measures. Experimental results verify the performance of existing IQA models can be consistently improved when our attention module is incorporated. The source code is available at https://github.com/KANGX99/SMIC.
