On Scaling Up 3D Gaussian Splatting Training

Hexu Zhao; Haoyang Weng; Daohan Lu; Ang Li; Jinyang Li; Aurojit Panda; Saining Xie

On Scaling Up 3D Gaussian Splatting Training

Hexu Zhao, Haoyang Weng, Daohan Lu, Ang Li, Jinyang Li, Aurojit Panda, Saining Xie

TL;DR

This work presents Grendel, a distributed system that scales 3D Gaussian Splatting training across multiple GPUs by partitioning Gaussians and pixel regions and using sparse communication. It introduces an automatic hyperparameter scaling rule based on the independent gradients hypothesis to enable batched training with large batch sizes without sacrificing quality. Empirical results on large-scale, high-resolution scenes show substantial throughput gains and reconstruction quality improvements over single-GPU training and rival methods like CityGaussian. Grendel enables high-detail 3D reconstructions at scales previously impractical with 3DGS and is released as open source.

Abstract

3D Gaussian Splatting (3DGS) is increasingly popular for 3D reconstruction due to its superior visual quality and rendering speed. However, 3DGS training currently occurs on a single GPU, limiting its ability to handle high-resolution and large-scale 3D reconstruction tasks due to memory constraints. We introduce Grendel, a distributed system designed to partition 3DGS parameters and parallelize computation across multiple GPUs. As each Gaussian affects a small, dynamic subset of rendered pixels, Grendel employs sparse all-to-all communication to transfer the necessary Gaussians to pixel partitions and performs dynamic load balancing. Unlike existing 3DGS systems that train using one camera view image at a time, Grendel supports batched training with multiple views. We explore various optimization hyperparameter scaling strategies and find that a simple sqrt(batch size) scaling rule is highly effective. Evaluations using large-scale, high-resolution scenes show that Grendel enhances rendering quality by scaling up 3DGS parameters across multiple GPUs. On the Rubble dataset, we achieve a test PSNR of 27.28 by distributing 40.4 million Gaussians across 16 GPUs, compared to a PSNR of 26.28 using 11.2 million Gaussians on a single GPU. Grendel is an open-source project available at: https://github.com/nyu-systems/Grendel-GS

On Scaling Up 3D Gaussian Splatting Training

TL;DR

Abstract

On Scaling Up 3D Gaussian Splatting Training

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (15)