Self-Ensembling Gaussian Splatting for Few-Shot Novel View Synthesis

Chen Zhao; Xuan Wang; Tong Zhang; Saqib Javed; Mathieu Salzmann

Self-Ensembling Gaussian Splatting for Few-Shot Novel View Synthesis

Chen Zhao, Xuan Wang, Tong Zhang, Saqib Javed, Mathieu Salzmann

TL;DR

This work addresses overfitting in 3D Gaussian Splatting (3DGS) for few-shot novel view synthesis by introducing Self-Ensembling Gaussian Splatting (SE-GS). SE-GS trains a non-perturbed Σ-model alongside a Δ-model whose parameters are perturbed in an uncertainty-guided manner, creating diverse yet reliable supervision from pseudo-views; a photometric regularization leverages this ensemble to improve generalization. Across LLFF, DTU, Mip-NeRF360, and MVImgNet, SE-GS achieves consistent improvements in PSNR, SSIM, and LPIPS under sparse-view conditions, outperforming prior methods with efficient training. The approach offers a practical, self-supervised regularization strategy for radiance-field models, enabling robust NVS with limited input views and broad applicability to 3D scene synthesis tasks.

Abstract

3D Gaussian Splatting (3DGS) has demonstrated remarkable effectiveness in novel view synthesis (NVS). However, 3DGS tends to overfit when trained with sparse views, limiting its generalization to novel viewpoints. In this paper, we address this overfitting issue by introducing Self-Ensembling Gaussian Splatting (SE-GS). We achieve self-ensembling by incorporating an uncertainty-aware perturbation strategy during training. A $\mathbfΔ$-model and a $\mathbfΣ$-model are jointly trained on the available images. The $\mathbfΔ$-model is dynamically perturbed based on rendering uncertainty across training steps, generating diverse perturbed models with negligible computational overhead. Discrepancies between the $\mathbfΣ$-model and these perturbed models are minimized throughout training, forming a robust ensemble of 3DGS models. This ensemble, represented by the $\mathbfΣ$-model, is then used to generate novel-view images during inference. Experimental results on the LLFF, Mip-NeRF360, DTU, and MVImgNet datasets demonstrate that our approach enhances NVS quality under few-shot training conditions, outperforming existing state-of-the-art methods. The code is released at: https://sailor-z.github.io/projects/SEGS.html.

Self-Ensembling Gaussian Splatting for Few-Shot Novel View Synthesis

TL;DR

Abstract

Self-Ensembling Gaussian Splatting for Few-Shot Novel View Synthesis

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (11)