Smol-GS: Compact Representations for Abstract 3D Gaussian Splatting
Haishan Wang, Mohammad Hassan Vali, Arno Solin
TL;DR
<3-5 sentence high-level summary>Smol-GS tackles the memory inefficiency of 3D Gaussian Splatting by learning a compact representation that separates geometry from appearance cues. It uses an occupancy-octree to encode coordinates and learns low-dimensional, splat-wise features that are quantized and entropy-coded, with tiny MLP decoders reconstructing view-dependent appearance. A stage-wise training regime with adaptive density control achieves state-of-the-art compression while preserving rendering fidelity, enabling real-time rendering and potential downstream tasks such as navigation. The method demonstrates an order-of-magnitude reduction in storage on standard benchmarks, matching or surpassing prior methods in quality at a fraction of the size, and opens avenues for discrete 3D scene tokens and editing.
Abstract
We present Smol-GS, a novel method for learning compact representations for 3D Gaussian Splatting (3DGS). Our approach learns highly efficient encodings in 3D space that integrate both spatial and semantic information. The model captures the coordinates of the splats through a recursive voxel hierarchy, while splat-wise features store abstracted cues, including color, opacity, transformation, and material properties. This design allows the model to compress 3D scenes by orders of magnitude without loss of flexibility. Smol-GS achieves state-of-the-art compression on standard benchmarks while maintaining high rendering quality. Beyond visual fidelity, the discrete representations could potentially serve as a foundation for downstream tasks such as navigation, planning, and broader 3D scene understanding.
