UHD-IQA Benchmark Database: Pushing the Boundaries of Blind Photo Quality Assessment
Vlad Hosu, Lorenzo Agnolucci, Oliver Wiedemann, Daisuke Iso, Dietmar Saupe
TL;DR
The paper presents the UHD-IQA Benchmark Database, a large-scale no-reference IQA dataset consisting of 6073 UHD-1 (4K) images annotated at a fixed width of 3840, with high-quality expert crowdsourced ratings and rich metadata. It tackles the lack of high-resolution, high-quality IQA data by combining expert recruitment, reliability-focused crowdsourcing, and a careful sampling pipeline that filters out synthetic images from Pixabay. The dataset structure includes 61 batches and two annotation rounds to obtain 20 ratings per image, enabling robust MOS estimation and reliable evaluation protocols. Evaluation of multiple NR-IQA methods demonstrates that self-supervised and CLIP-based approaches achieve strong correlations on UHD data but still face challenges in RMSE/MAE, underscoring the need for UHD-specific training and models; the dataset thus provides a valuable resource for advancing practical, high-resolution NR-IQA and cross-resolution generalization.
Abstract
We introduce a novel Image Quality Assessment (IQA) dataset comprising 6073 UHD-1 (4K) images, annotated at a fixed width of 3840 pixels. Contrary to existing No-Reference (NR) IQA datasets, ours focuses on highly aesthetic photos of high technical quality, filling a gap in the literature. The images, carefully curated to exclude synthetic content, are sufficiently diverse to train general NR-IQA models. Importantly, the dataset is annotated with perceptual quality ratings obtained through a crowdsourcing study. Ten expert raters, comprising photographers and graphics artists, assessed each image at least twice in multiple sessions spanning several days, resulting in 20 highly reliable ratings per image. Annotators were rigorously selected based on several metrics, including self-consistency, to ensure their reliability. The dataset includes rich metadata with user and machine-generated tags from over 5,000 categories and popularity indicators such as favorites, likes, downloads, and views. With its unique characteristics, such as its focus on high-quality images, reliable crowdsourced annotations, and high annotation resolution, our dataset opens up new opportunities for advancing perceptual image quality assessment research and developing practical NR-IQA models that apply to modern photos. Our dataset is available at https://database.mmsp-kn.de/uhd-iqa-benchmark-database.html
