Quality-Aware Dynamic Resolution Adaptation Framework for Adaptive Video Streaming

Amritha Premkumar; Prajit T Rajendran; Vignesh V Menon; Adam Wieckowski; Benjamin Bross; Detlev Marpe

Quality-Aware Dynamic Resolution Adaptation Framework for Adaptive Video Streaming

Amritha Premkumar, Prajit T Rajendran, Vignesh V Menon, Adam Wieckowski, Benjamin Bross, Detlev Marpe

TL;DR

QADRA presents a latency-aware, quality-driven framework for adaptive video streaming that predicts $XPSNR$ for VVC-coded representations from spatiotemporal features and encoding settings using $XGBoost$, then optimizes per-segment encoding resolution and $QP$ under encoding/decoding time constraints. It introduces a convex-hull online resolution selection, a two-model $QP$ predictor with linear bitrate mapping, and a JND-based representation elimination to prune the bitrate ladder, all implemented in an open-source Python framework. Empirical results show that QADRA can achieve higher RD performance (higher $XPSNR$ at a given bitrate) than standard ladders while exposing the trade-off between quality and encoding/decoding time, illustrating practical gains for energy-efficient, low-latency streaming. The approach enables scalable, content-adaptive bitrate ladders that adapt to device and network diversity, with potential extensions to VR/AR and distributed encoding workflows.

Abstract

Traditional per-title encoding schemes aim to optimize encoding resolutions to deliver the highest perceptual quality for each representation. XPSNR is observed to correlate better with the subjective quality of VVC-coded bitstreams. Towards this realization, we predict the average XPSNR of VVC-coded bitstreams using spatiotemporal complexity features of the video and the target encoding configuration using an XGBoost-based model. Based on the predicted XPSNR scores, we introduce a Quality-A ware Dynamic Resolution Adaptation (QADRA) framework for adaptive video streaming applications, where we determine the convex-hull online. Furthermore, keeping the encoding and decoding times within an acceptable threshold is mandatory for smooth and energy-efficient streaming. Hence, QADRA determines the encoding resolution and quantization parameter (QP) for each target bitrate by maximizing XPSNR while constraining the maximum encoding and/ or decoding time below a threshold. QADRA implements a JND-based representation elimination algorithm to remove perceptually redundant representations from the bitrate ladder. QADRA is an open-source Python-based framework published under the GNU GPLv3 license. Github: https://github.com/PhoenixVideo/QADRA Online documentation: https://phoenixvideo.github.io/QADRA/

Quality-Aware Dynamic Resolution Adaptation Framework for Adaptive Video Streaming

TL;DR

QADRA presents a latency-aware, quality-driven framework for adaptive video streaming that predicts

for VVC-coded representations from spatiotemporal features and encoding settings using

, then optimizes per-segment encoding resolution and

under encoding/decoding time constraints. It introduces a convex-hull online resolution selection, a two-model

predictor with linear bitrate mapping, and a JND-based representation elimination to prune the bitrate ladder, all implemented in an open-source Python framework. Empirical results show that QADRA can achieve higher RD performance (higher

at a given bitrate) than standard ladders while exposing the trade-off between quality and encoding/decoding time, illustrating practical gains for energy-efficient, low-latency streaming. The approach enables scalable, content-adaptive bitrate ladders that adapt to device and network diversity, with potential extensions to VR/AR and distributed encoding workflows.

Abstract

Paper Structure (12 sections, 4 equations, 4 figures, 2 tables, 1 algorithm)

This paper contains 12 sections, 4 equations, 4 figures, 2 tables, 1 algorithm.

Introduction
Related work
Quality-aware dynamic resolution adaptation (QADRA)
Spatiotemporal complexity feature extraction
Optimized resolution prediction
Optimized QP prediction
JND-based representation elimination
Commandline options
Prediction models
Scalability and adaptability
Evaluation results
Conclusions and future directions

Figures (4)

Figure 1: Rate-distortion (RD) and rate-encoding time curves of representative sequences (segments) of Inter-4K dataset stergiou_adapool_2023 encoded at 360p, 720p, 1080p and 2160p resolutions using VVenC encoder wieckowski_vvenc_2021 at faster preset. Here, XPSNR helmrich_xpsnr_2020 is used as the quality metric.
Figure 2: Encoding using QADRA framework envisioned in this paper.
Figure 3: Selected encoding resolutions of representative video sequences (segments) using QADRA ($r_{\text{max}}=2160$).
Figure 4: RD curves, and encoding times of representative video sequences (segments) using QADRA ($r_{\text{max}}=2160$).

Quality-Aware Dynamic Resolution Adaptation Framework for Adaptive Video Streaming

TL;DR

Abstract

Quality-Aware Dynamic Resolution Adaptation Framework for Adaptive Video Streaming

Authors

TL;DR

Abstract

Table of Contents

Figures (4)