Image-Conditioned 3D Gaussian Splat Quantization
Xinshuang Liu, Runfa Blark Li, Keito Suzuki, Truong Nguyen
TL;DR
This work tackles the challenge of archiving large-scale 3D Gaussian Splatting (3DGS) scenes by introducing ICGS-Quantizer, which jointly quantizes Gaussians and attributes using shared codebooks learned across many scenes. It further enables adaptability to post-archival scene changes by conditioning decoding on one or more current images, using a coarse-to-fine fusion of image features with the latent scene representation. The approach achieves kilobyte-scale storage while maintaining high visual fidelity and delivers strong performance in both 3D scene compression and 3D scene updating, outperforming state-of-the-art methods and approaching upper-bound references with far less storage. The method's joint encoding, image-conditioned decoding, and shared-codebook design offer practical benefits for large collections and storage-constrained deployments, with public code and data anticipated to accelerate adoption.
Abstract
3D Gaussian Splatting (3DGS) has attracted considerable attention for enabling high-quality real-time rendering. Although 3DGS compression methods have been proposed for deployment on storage-constrained devices, two limitations hinder archival use: (1) they compress medium-scale scenes only to the megabyte range, which remains impractical for large-scale scenes or extensive scene collections; and (2) they lack mechanisms to accommodate scene changes after long-term archival. To address these limitations, we propose an Image-Conditioned Gaussian Splat Quantizer (ICGS-Quantizer) that substantially enhances compression efficiency and provides adaptability to scene changes after archiving. ICGS-Quantizer improves quantization efficiency by jointly exploiting inter-Gaussian and inter-attribute correlations and by using shared codebooks across all training scenes, which are then fixed and applied to previously unseen test scenes, eliminating the overhead of per-scene codebooks. This approach effectively reduces the storage requirements for 3DGS to the kilobyte range while preserving visual fidelity. To enable adaptability to post-archival scene changes, ICGS-Quantizer conditions scene decoding on images captured at decoding time. The encoding, quantization, and decoding processes are trained jointly, ensuring that the codes, which are quantized representations of the scene, are effective for conditional decoding. We evaluate ICGS-Quantizer on 3D scene compression and 3D scene updating. Experimental results show that ICGS-Quantizer consistently outperforms state-of-the-art methods in compression efficiency and adaptability to scene changes. Our code, model, and data will be publicly available on GitHub.
