AI-Driven Innovations in Volumetric Video Streaming: A Review
Erfan Entezami, Hui Guan
TL;DR
This paper analyzes AI-driven approaches to volumetric video streaming, focusing on how to efficiently transmit and render 6-DoF content represented as point clouds, NeRF, or 3D Gaussian splatting. It proposes a taxonomy distinguishing explicit/implicit and learnable/fixed representations, and surveys state-of-the-art techniques for each representation: viewport- and quality-based strategies for point clouds; time-aware, deformation-based, and grid-based NeRF methods with rendering accelerations; and motion-tracking and deformation-based extensions for 3DGS. Key contributions include synthesizing challenges and proposing future directions like robust motion handling, edge-device acceleration, and scalable long-sequence streaming. The insights are relevant for researchers and practitioners aiming to deploy volumetric streaming in immersive applications and future networks.
Abstract
Recent efforts to enhance immersive and interactive user experiences have driven the development of volumetric video, a form of 3D content that enables 6 DoF. Unlike traditional 2D content, volumetric content can be represented in various ways, such as point clouds, meshes, or neural representations. However, due to its complex structure and large amounts of data size, deploying this new form of 3D data presents significant challenges in transmission and rendering. These challenges have hindered the widespread adoption of volumetric video in daily applications. In recent years, researchers have proposed various AI-driven techniques to address these challenges and improve the efficiency and quality of volumetric content streaming. This paper provides a comprehensive overview of recent advances in AI-driven approaches to facilitate volumetric content streaming. Through this review, we aim to offer insights into the current state-of-the-art and suggest potential future directions for advancing the deployment of volumetric video streaming in real-world applications.
