Privis: Towards Content-Aware Secure Volumetric Video Delivery
Kaiyuan Hu, Hong Kang, Yili Jin, Junhua Liu, Chengming Hu, Haolun Wu, Xue Liu
TL;DR
The paper addresses secure delivery of volumetric video under tight motion-to-photon latency and diverse privacy needs. It introduces Privis, a saliency-guided transport framework that partitions content into cubes, applies adaptive, per-cube AEAD, and employs traffic shaping to mitigate leakage while meeting MTP latency. Contributions include a formal saliency model $s(c) = \alpha \Phi_{p}(c) + (1-\alpha) \Phi_{s}(c)$, a cube-level protection policy $P(s(c))$, and a transport-layer architecture compatible with existing codecs. Prototype results on the FSVVD dataset show Privis achieves substantial confidentiality gains with only modest latency overhead, validating the feasibility of saliency-conditioned secure volumetric streaming.
Abstract
Volumetric video has emerged as a key paradigm in eXtended Reality (XR) and immersive multimedia because it enables highly interactive, spatially consistent 3D experiences. However, the transport-layer security for such 3D content remains largely unaddressed. Existing volumetric streaming pipelines inherit uniform encryption schemes from 2D video, overlooking the heterogeneous privacy sensitivity of different geometry and the strict motion-to-photon latency constraints of real-time XR. We take an initial step toward content-aware secure volumetric video delivery by introducing Privis, a saliency-guided transport framework that (i) partitions volumetric assets into independent units, (ii) applies lightweight authenticated encryption with adaptive key rotation, and (iii) employs selective traffic shaping to balance confidentiality and low latency. Privis specifies a generalized transport-layer security architecture for volumetric media, defining core abstractions and adaptive protection mechanisms. We further explore a prototype implementation and present initial latency measurements to illustrate feasibility and design tradeoffs, providing early empirical guidance toward future work on real-time, saliency-conditioned secure delivery.
