Quality-Aware Dynamic Resolution Adaptation Framework for Adaptive Video Streaming
Amritha Premkumar, Prajit T Rajendran, Vignesh V Menon, Adam Wieckowski, Benjamin Bross, Detlev Marpe
TL;DR
QADRA presents a latency-aware, quality-driven framework for adaptive video streaming that predicts $XPSNR$ for VVC-coded representations from spatiotemporal features and encoding settings using $XGBoost$, then optimizes per-segment encoding resolution and $QP$ under encoding/decoding time constraints. It introduces a convex-hull online resolution selection, a two-model $QP$ predictor with linear bitrate mapping, and a JND-based representation elimination to prune the bitrate ladder, all implemented in an open-source Python framework. Empirical results show that QADRA can achieve higher RD performance (higher $XPSNR$ at a given bitrate) than standard ladders while exposing the trade-off between quality and encoding/decoding time, illustrating practical gains for energy-efficient, low-latency streaming. The approach enables scalable, content-adaptive bitrate ladders that adapt to device and network diversity, with potential extensions to VR/AR and distributed encoding workflows.
Abstract
Traditional per-title encoding schemes aim to optimize encoding resolutions to deliver the highest perceptual quality for each representation. XPSNR is observed to correlate better with the subjective quality of VVC-coded bitstreams. Towards this realization, we predict the average XPSNR of VVC-coded bitstreams using spatiotemporal complexity features of the video and the target encoding configuration using an XGBoost-based model. Based on the predicted XPSNR scores, we introduce a Quality-A ware Dynamic Resolution Adaptation (QADRA) framework for adaptive video streaming applications, where we determine the convex-hull online. Furthermore, keeping the encoding and decoding times within an acceptable threshold is mandatory for smooth and energy-efficient streaming. Hence, QADRA determines the encoding resolution and quantization parameter (QP) for each target bitrate by maximizing XPSNR while constraining the maximum encoding and/ or decoding time below a threshold. QADRA implements a JND-based representation elimination algorithm to remove perceptually redundant representations from the bitrate ladder. QADRA is an open-source Python-based framework published under the GNU GPLv3 license. Github: https://github.com/PhoenixVideo/QADRA Online documentation: https://phoenixvideo.github.io/QADRA/
