SAGS: Structure-Aware 3D Gaussian Splatting
Evangelos Ververas, Rolandos Alexandros Potamias, Jifei Song, Jiankang Deng, Stefanos Zafeiriou
TL;DR
We address the limitations of geometry-agnostic 3D Gaussian Splatting (3D-GS) in novel-view synthesis, which often yields floaters and depth errors. We propose Structure-Aware Gaussian Splatting (SAGS), a pipeline that uses a local-global graph to encode scene geometry, a curvature-aware densification step, a structure-aware encoder, and a refinement network to predict Gaussian attributes, plus a lightweight SAGS-Lite variant with mid-point interpolation for compact representations. Across 13 scenes from Mip-NeRF360, Tanks&Temples, and Deep Blending, SAGS achieves state-of-the-art rendering quality while reducing model storage by up to 11.7× for the full model and up to 24× for the lite variant, all while maintaining real-time rendering. Our results demonstrate that enforcing structure preserves scene topology and depth, reduces artifacts, and enables efficient, high-fidelity neural rendering suitable for VR/AR applications.
Abstract
Following the advent of NeRFs, 3D Gaussian Splatting (3D-GS) has paved the way to real-time neural rendering overcoming the computational burden of volumetric methods. Following the pioneering work of 3D-GS, several methods have attempted to achieve compressible and high-fidelity performance alternatives. However, by employing a geometry-agnostic optimization scheme, these methods neglect the inherent 3D structure of the scene, thereby restricting the expressivity and the quality of the representation, resulting in various floating points and artifacts. In this work, we propose a structure-aware Gaussian Splatting method (SAGS) that implicitly encodes the geometry of the scene, which reflects to state-of-the-art rendering performance and reduced storage requirements on benchmark novel-view synthesis datasets. SAGS is founded on a local-global graph representation that facilitates the learning of complex scenes and enforces meaningful point displacements that preserve the scene's geometry. Additionally, we introduce a lightweight version of SAGS, using a simple yet effective mid-point interpolation scheme, which showcases a compact representation of the scene with up to 24$\times$ size reduction without the reliance on any compression strategies. Extensive experiments across multiple benchmark datasets demonstrate the superiority of SAGS compared to state-of-the-art 3D-GS methods under both rendering quality and model size. Besides, we demonstrate that our structure-aware method can effectively mitigate floating artifacts and irregular distortions of previous methods while obtaining precise depth maps. Project page https://eververas.github.io/SAGS/.
