Image Statistics Predict the Sensitivity of Perceptual Quality Metrics
Alexander Hepburn, Valero Laparra, Raúl Santos-Rodriguez, Jesús Malo
TL;DR
This work directly links perceptual sensitivity of image-quality metrics to image statistics by estimating natural-image probabilities with a modern generative model (PixelCNN++). It finds that simple, probability-based factors—most notably the log-probability of the distorted image $\log(p(\tilde{\mathbf{x}}))$ and the original image’s standard deviation $\sigma(\mathbf{x})$—can predict metric sensitivity with correlations up to $\rho \approx 0.77$, approaching human-like performance in some evaluations. The authors validate the approach on natural-image psychophysics, reproduce several classical psychophysical trends (Weber law, CSF, masking), and show a non-parametric model can reach $\rho \approx 0.85$, with a simple interpretable form achieving $\rho \approx 0.77$. The results provide direct, quantitative support for a probabilistic, information-theoretic view of vision and offer a practical framework for predicting perceptual sensitivity from image statistics, despite limitations related to dataset scope and synthetic-stimulus generalization.
Abstract
Previously, Barlow and Attneave hypothesised a link between biological vision and information maximisation. Following Shannon, information was defined using the probability of natural images. Several physiological and psychophysical phenomena have been derived from principles like info-max, efficient coding, or optimal denoising. However, it remains unclear how this link is expressed in mathematical terms from image probability. Classical derivations were subjected to strong assumptions on the probability models and on the behaviour of the sensors. Moreover, the direct evaluation of the hypothesis was limited by the inability of classical image models to deliver accurate estimates of the probability. Here, we directly evaluate image probabilities using a generative model for natural images, and analyse how probability-related factors can be combined to predict the sensitivity of state-of-the-art subjective image quality metrics, a proxy for human perception. We use information theory and regression analysis to find a simple model that when combining just two probability-related factors achieves 0.77 correlation with subjective metrics. This probability-based model is validated in two ways: through direct comparison with the opinion of real observers in a subjective quality experiment, and by reproducing basic trends of classical psychophysical facts such as the Contrast Sensitivity Function, the Weber-law, and contrast masking.
