LoopSparseGS: Loop Based Sparse-View Friendly Gaussian Splatting
Zhenyu Bao, Guibiao Liao, Kaichen Zhou, Kanglin Liu, Qing Li, Guoping Qiu
TL;DR
LoopSparseGS addresses sparse-input novel view synthesis for 3D Gaussian Splatting by introducing a loop-based Progressive Gaussian Initialization that densifies initial point clouds using pseudo-views, a Depth-alignment Regularization that fuses sparse SfM depth with dense monocular depth via a sliding-window loss, and a Sparse-friendly Sampling strategy that splits oversized Gaussians guided by pixel error. These components collectively provide denser geometry, more reliable depth supervision, and improved handling of large Gaussians, yielding state-of-the-art performance on four datasets (indoor, outdoor, and object-level) across multiple resolutions. The method demonstrates robust improvements over existing approaches in PSNR, SSIM, and perceptual metrics, while maintaining efficient rendering speeds. This work advances sparse-view NVS by integrating loop-based initialization, depth alignment, and adaptive Gaussian densification to produce photorealistic results with limited input views, enabling more practical deployment in real-world scenarios.
Abstract
Despite the photorealistic novel view synthesis (NVS) performance achieved by the original 3D Gaussian splatting (3DGS), its rendering quality significantly degrades with sparse input views. This performance drop is mainly caused by the limited number of initial points generated from the sparse input, insufficient supervision during the training process, and inadequate regularization of the oversized Gaussian ellipsoids. To handle these issues, we propose the LoopSparseGS, a loop-based 3DGS framework for the sparse novel view synthesis task. In specific, we propose a loop-based Progressive Gaussian Initialization (PGI) strategy that could iteratively densify the initialized point cloud using the rendered pseudo images during the training process. Then, the sparse and reliable depth from the Structure from Motion, and the window-based dense monocular depth are leveraged to provide precise geometric supervision via the proposed Depth-alignment Regularization (DAR). Additionally, we introduce a novel Sparse-friendly Sampling (SFS) strategy to handle oversized Gaussian ellipsoids leading to large pixel errors. Comprehensive experiments on four datasets demonstrate that LoopSparseGS outperforms existing state-of-the-art methods for sparse-input novel view synthesis, across indoor, outdoor, and object-level scenes with various image resolutions.
