Trimming the Fat: Efficient Compression of 3D Gaussian Splats through Pruning

Muhammad Salman Ali; Maryam Qamar; Sung-Ho Bae; Enzo Tartaglione

Trimming the Fat: Efficient Compression of 3D Gaussian Splats through Pruning

Muhammad Salman Ali, Maryam Qamar, Sung-Ho Bae, Enzo Tartaglione

TL;DR

The paper tackles the scalability challenge of differentiable 3D Gaussian Splatting (3DGS) by introducing Trimming the Fat, a gradient-informed post-hoc pruning method. Starting from a pre-trained 3DGS, it iteratively prunes Gaussians using opacity and gradient signals, guided by a 3D prior to preserve scene fidelity, and then fine-tunes the remainder. The approach achieves up to 4× pruning on its own and up to ~25× compression when combined with end-to-end compression, while delivering up to 600 FPS and maintaining high rendering quality across diverse benchmarks. Ablation studies show the necessity of a 3D prior, the superiority of iterative pruning over one-shot pruning, and the method’s advantages over competing compression schemes, underscoring its potential for edge devices and real-time applications.

Abstract

In recent times, the utilization of 3D models has gained traction, owing to the capacity for end-to-end training initially offered by Neural Radiance Fields and more recently by 3D Gaussian Splatting (3DGS) models. The latter holds a significant advantage by inherently easing rapid convergence during training and offering extensive editability. However, despite rapid advancements, the literature still lives in its infancy regarding the scalability of these models. In this study, we take some initial steps in addressing this gap, showing an approach that enables both the memory and computational scalability of such models. Specifically, we propose "Trimming the fat", a post-hoc gradient-informed iterative pruning technique to eliminate redundant information encoded in the model. Our experimental findings on widely acknowledged benchmarks attest to the effectiveness of our approach, revealing that up to 75% of the Gaussians can be removed while maintaining or even improving upon baseline performance. Our approach achieves around 50$\times$ compression while preserving performance similar to the baseline model, and is able to speed-up computation up to 600 FPS.

Trimming the Fat: Efficient Compression of 3D Gaussian Splats through Pruning

TL;DR

Abstract

Trimming the Fat: Efficient Compression of 3D Gaussian Splats through Pruning

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)