RAVQ-HoloNet: Rate-Adaptive Vector-Quantized Hologram Compression
Shima Rafiei, Zahra Nabizadeh Shahr Babak, Shadrokh Samavi, Shahram Shirani
TL;DR
RAVQ-HoloNet tackles the data-telemetry bottleneck in holographic AR/VR by introducing a rate-adaptive, vector-quantized compression framework that unifies complex-valued hologram encoding with phase-only reconstruction. The method leverages a hierarchical VQ latent space with EMA codebooks and a Seq2seq bitrate tuner to enable continuous bitrate control within a single model, avoiding multiple-model maintenance. A two-stage training regime couples a physics-informed ASM-based reconstruction loss with traditional VQ losses, yielding competitive rate–distortion performance; on 2D holograms, it achieves up to -33.91% BD-Rate improvement and $1.02$ dB BD-PSNR over state-of-the-art DPRC at ultra-low bitrates. The approach shows clear practical impact for bandwidth- and storage-constrained devices, and extends naturally to RGB-D and 3D holographic representations, as well as gaze-driven compression strategies for perceptual prioritization.
Abstract
Holography offers significant potential for AR/VR applications, yet its adoption is limited by the high demands of data compression. Existing deep learning approaches generally lack rate adaptivity within a single network. We present RAVQ-HoloNet, a rate-adaptive vector quantization framework that achieves high-fidelity reconstructions at low and ultra-low bit rates, outperforming current state-of-the-art methods. In low bit, our method exceeds by -33.91% in BD-Rate and achieves a BD-PSNR of 1.02 dB from the best existing method demonstrated by the rate-distortion curve.
