F-3DGS: Factorized Coordinates and Representations for 3D Gaussian Splatting
Xiangyu Sun, Joo Chan Lee, Daniel Rho, Jong Hwan Ko, Usman Ali, Eunbyung Park
TL;DR
This work tackles the storage bottleneck of 3D Gaussian Splatting (3DGS) used in fast neural rendering by introducing Factorized 3D Gaussian Splatting (F-3DGS). It employs two factorization schemes, canonical polyadic (CP) and vector-matrix (VM), to compress coordinates and per-Gaussian attributes, complemented by trainable binary masking to prune non-contributing Gaussians. The approach achieves substantial storage reductions (often >90%) while preserving image quality and enabling real-time rendering across multiple datasets, including synthetic-NeRF, Tanks&Temples, and Mip-NeRF360. By avoiding fixed grids and leveraging factorized representations, F-3DGS generalizes to large or unbounded scenes with scalable performance, marking a practical advancement for resource-constrained neural rendering applications.
Abstract
The neural radiance field (NeRF) has made significant strides in representing 3D scenes and synthesizing novel views. Despite its advancements, the high computational costs of NeRF have posed challenges for its deployment in resource-constrained environments and real-time applications. As an alternative to NeRF-like neural rendering methods, 3D Gaussian Splatting (3DGS) offers rapid rendering speeds while maintaining excellent image quality. However, as it represents objects and scenes using a myriad of Gaussians, it requires substantial storage to achieve high-quality representation. To mitigate the storage overhead, we propose Factorized 3D Gaussian Splatting (F-3DGS), a novel approach that drastically reduces storage requirements while preserving image quality. Inspired by classical matrix and tensor factorization techniques, our method represents and approximates dense clusters of Gaussians with significantly fewer Gaussians through efficient factorization. We aim to efficiently represent dense 3D Gaussians by approximating them with a limited amount of information for each axis and their combinations. This method allows us to encode a substantially large number of Gaussians along with their essential attributes -- such as color, scale, and rotation -- necessary for rendering using a relatively small number of elements. Extensive experimental results demonstrate that F-3DGS achieves a significant reduction in storage costs while maintaining comparable quality in rendered images.
