LL-GABR: Energy Efficient Live Video Streaming Using Reinforcement Learning
Adithya Raman, Bekir Turkkan, Tevfik Kosar
TL;DR
This paper tackles the energy inefficiency of live video streaming on mobile by redefining QoE through perceptual quality, measured with VMAF, rather than bitrate. It introduces LL-GABR, a deep reinforcement learning agent that jointly optimizes playback bitrate and speed under a lightweight bandwidth predictor and an energy-aware QoE reward, generalizable across per-title encoding ladders without retraining. Using a distributed Soft-Actor-Critic framework with Priority Experience Replay, LL-GABR achieves up to 44% higher perceptual QoE and up to 73% better energy efficiency, while reducing total energy consumption by 11% across trace-driven evaluations. The approach demonstrates the potential for scalable, energy-conscious live ABR in mobile environments, balancing perceptual quality, latency, and energy use in real time.
Abstract
Over the recent years, research and development in adaptive bitrate (ABR) algorithms for live video streaming have been successful in improving users' quality of experience (QoE) by reducing latency to near real-time levels while delivering higher bitrate videos with minimal rebuffering time. However, the QoE models used by these ABR algorithms do not take into account that a large portion of live video streaming clients use mobile devices where a higher bitrate does not necessarily translate into higher perceived quality. Ignoring perceived quality results in playing videos at higher bitrates without a significant increase in perceptual video quality and becomes a burden for battery-constrained mobile devices due to higher energy consumption. In this paper, we propose LL-GABR, a deep reinforcement learning approach that models the QoE using perceived video quality instead of bitrate and uses energy consumption along with other metrics like latency, rebuffering events, and smoothness. LL-GABR makes no assumptions about the underlying video, environment, or network settings and can operate flexibly on different video titles, each having a different bitrate encoding ladder without additional re-training, unlike existing learning-based ABRs. Trace-driven experimental results show that LL-GABR outperforms the state-of-the-art approaches by up to 44% in terms of perceptual QoE and a 73% increase in energy efficiency as a result of reducing net energy consumption by 11%.
