Generalized Portrait Quality Assessment
Nicolas Chahine, Sira Ferradans, Javier Vazquez-Corral, Jean Ponce
TL;DR
This paper addresses portrait quality assessment (PQA) under diverse real-world scenes. It introduces FULL-HyperIQA (FHIQA), a semantics-aware BIQA model that uses a scene-prediction vector to perform scene-conditioned rescaling of patch-level quality with the final score $Q_f = ( \sum_{i=1}^{k} P_{s_i} (a^{s}_{i} Q_p + b^{s}_{i}) ) / ( \sum_{j=1}^{k} P_{s_j} )$. It extends prior HyperIQA with a full-scene prediction for handling unseen content and shows competitive or superior performance on PIQ23 across multiple attributes, especially overall portrait quality. This demonstrates the value of semantic understanding for robust, generalizable IQA in smartphone photography and proposes a path toward content-specific evaluation.
Abstract
Automated and robust portrait quality assessment (PQA) is of paramount importance in high-impact applications such as smartphone photography. This paper presents FHIQA, a learning-based approach to PQA that introduces a simple but effective quality score rescaling method based on image semantics, to enhance the precision of fine-grained image quality metrics while ensuring robust generalization to various scene settings beyond the training dataset. The proposed approach is validated by extensive experiments on the PIQ23 benchmark and comparisons with the current state of the art. The source code of FHIQA will be made publicly available on the PIQ23 GitHub repository at https://github.com/DXOMARK-Research/PIQ2023.
