Enhancing 3D Gaussian Splatting Compression via Spatial Condition-based Prediction
Jingui Ma, Yang Hu, Luyang Tang, Jiayu Yang, Yongqi Zhai, Ronggang Wang
TL;DR
3D Gaussian Splatting (3DGS) enables real-time neural rendering but incurs heavy storage costs. The authors introduce a spatial condition-based prediction framework that predicts the anchor feature $f$ from grid-derived context $f_c$ and a learnable residual $f_r$ via FP-Net, supplemented by an instance-aware hyper prior to improve residual entropy estimation. This approach achieves substantial bit-rate reductions, notably 24.42% over the SOTA HAC method and up to 105x reduction compared with vanilla 3DGS, while maintaining rendering quality across five datasets. The work advances practical deployment of 3DGS by combining prediction and structured entropy modeling, and provides a release-ready codebase.
Abstract
Recently, 3D Gaussian Spatting (3DGS) has gained widespread attention in Novel View Synthesis (NVS) due to the remarkable real-time rendering performance. However, the substantial cost of storage and transmission of vanilla 3DGS hinders its further application (hundreds of megabytes or even gigabytes for a single scene). Motivated by the achievements of prediction in video compression, we introduce the prediction technique into the anchor-based Gaussian representation to effectively reduce the bit rate. Specifically, we propose a spatial condition-based prediction module to utilize the grid-captured scene information for prediction, with a residual compensation strategy designed to learn the missing fine-grained information. Besides, to further compress the residual, we propose an instance-aware hyper prior, developing a structure-aware and instance-aware entropy model. Extensive experiments demonstrate the effectiveness of our prediction-based compression framework and each technical component. Even compared with SOTA compression method, our framework still achieves a bit rate savings of 24.42 percent. Code is to be released!
