HDRSDR-VQA: A Subjective Video Quality Dataset for HDR and SDR Comparative Evaluation
Bowen Chen, Cheng-han Lee, Yixu Chen, Zaixi Shang, Hai Wei, Alan C. Bovik
TL;DR
HDRSDR-VQA addresses the need for large-scale, content-level HDR vs SDR VQA by introducing 960 videos from 54 source sequences, rendered in HDR10 and SDR across nine quality levels, and evaluated through a 145-participant subjective study on six HDR-capable TVs. Using pairwise comparisons and ASAP active sampling, the study yields $JOD$ scores via pwcmp, enabling fine-grainedHDR/SDR comparisons under realistic viewing. The results show HDR provides perceptual advantages for content with rich textures and high brightness/color gamut, but these advantages diminish or reverse in low-dynamic or motion-heavy scenes, highlighting the content-dependence of HDR benefits. The dataset offers a valuable resource for benchmarking objective metrics and guiding perceptual optimization in HDR encoding and adaptive streaming systems.
Abstract
We introduce HDRSDR-VQA, a large-scale video quality assessment dataset designed to facilitate comparative analysis between High Dynamic Range (HDR) and Standard Dynamic Range (SDR) content under realistic viewing conditions. The dataset comprises 960 videos generated from 54 diverse source sequences, each presented in both HDR and SDR formats across nine distortion levels. To obtain reliable perceptual quality scores, we conducted a comprehensive subjective study involving 145 participants and six consumer-grade HDR-capable televisions. A total of over 22,000 pairwise comparisons were collected and scaled into Just-Objectionable-Difference (JOD) scores. Unlike prior datasets that focus on a single dynamic range format or use limited evaluation protocols, HDRSDR-VQA enables direct content-level comparison between HDR and SDR versions, supporting detailed investigations into when and why one format is preferred over the other. The open-sourced part of the dataset is publicly available to support further research in video quality assessment, content-adaptive streaming, and perceptual model development.
