QUASAR: QUality and Aesthetics Scoring with Advanced Representations
Sergey Kastryulin, Denis Prokopenko, Artem Babenko, Dmitry V. Dylov
TL;DR
This work tackles the need for generalizable image quality and aesthetics assessment without prompt engineering. It introduces QUASAR, a data-driven non-parametric framework built from Anchor Data, an Image Encoder, and an Aggregation Function to produce a unified score. Across 8 benchmarks and 7 self-supervised models, QUASAR outperforms CLIP-IQA and demonstrates robustness to data preprocessing and anchor choice. The approach yields high agreement with human judgments, even with limited data, and offers a scalable, universal tool for evaluating both technical quality and aesthetics of visual content.
Abstract
This paper introduces a new data-driven, non-parametric method for image quality and aesthetics assessment, surpassing existing approaches and requiring no prompt engineering or fine-tuning. We eliminate the need for expressive textual embeddings by proposing efficient image anchors in the data. Through extensive evaluations of 7 state-of-the-art self-supervised models, our method demonstrates superior performance and robustness across various datasets and benchmarks. Notably, it achieves high agreement with human assessments even with limited data and shows high robustness to the nature of data and their pre-processing pipeline. Our contributions offer a streamlined solution for assessment of images while providing insights into the perception of visual information.
