SizeGS: Size-aware Compression of 3D Gaussian Splatting via Mixed Integer Programming
Shuzhao Xie, Jiahang Liu, Weixiang Zhang, Shijia Ge, Sicheng Pan, Chen Tang, Yunpeng Bai, Cong Zhang, Xiaoyi Fan, Zhi Wang
TL;DR
SizeGS addresses the large data footprint of 3D Gaussian Splatting by casting size-aware compression as a MINLP and decoupling the optimization into discrete reserve-ratio search and 0-1 ILP for bit-widths. It establishes a linearized size model and a quantization-loss proxy to rapidly approximate quality, enabling fast exploration of hyperparameters to meet a target size. Through CUDA-accelerated quantization and a piecewise finetuning stage, SizeGS delivers state-of-the-art offline compression and competitive results against online methods after finetuning, across multiple 3DGS variants. The approach offers practical benefits for bandwidth-fluctuating scenarios such as volumetric streaming and remote teleoperation by enabling quick, near real-time size-constrained compression.
Abstract
Recent advances in 3D Gaussian Splatting (3DGS) have greatly improved 3D reconstruction. However, its substantial data size poses a significant challenge for transmission and storage. While many compression techniques have been proposed, they fail to efficiently adapt to fluctuating network bandwidth, leading to resource wastage. We address this issue from the perspective of size-aware compression, where we aim to compress 3DGS to a desired size by quickly searching for suitable hyperparameters. Through a measurement study, we identify key hyperparameters that affect the size -- namely, the reserve ratio of Gaussians and bit-width settings for Gaussian attributes. Then, we formulate this hyperparameter optimization problem as a mixed-integer nonlinear programming (MINLP) problem, with the goal of maximizing visual quality while respecting the size budget constraint. To solve the MINLP, we decouple this problem into two parts: discretely sampling the reserve ratio and determining the bit-width settings using integer linear programming (ILP). To solve the ILP more quickly and accurately, we design a quality loss estimator and a calibrated size estimator, as well as implement a CUDA kernel. Extensive experiments on multiple 3DGS variants demonstrate that our method achieves state-of-the-art performance in post-training compression. Furthermore, our method can achieve comparable quality to leading training-required methods after fine-tuning.
