S2Gaussian: Sparse-View Super-Resolution 3D Gaussian Splatting
Yecong Wan, Mingwen Shao, Yuanshuo Cheng, Wangmeng Zuo
TL;DR
This work tackles recovering high-fidelity 3D scenes from highly constrained inputs by introducing S2Gaussian, a two-stage framework that first builds a low-resolution Gaussian Splatting representation from sparse views and depth, then densifies to high-resolution Gaussians via Gaussian Shuffle Split. The HR Gaussians are refined using super-resolved imagery from both original views and pseudo-views, with a blur-free inconsistency modeling module and a 3D robust optimization strategy to mitigate multi-view inconsistencies and erroneous updates. Key contributions include the Gaussian Shuffle Split for compact, detail-rich initialization, a dedicated inconsistency-aware refinement pipeline, and robust optimization that yields superior geometry and texture quality, achieving state-of-the-art results on Blender, LLFF, and Mip-NeRF 360 under 4x sparse-view super-resolution. This framework advances practical 3D reconstruction in scenarios where both viewpoint sparsity and image blur limit performance, enabling more consistent rendering and finer details in real-world applications.
Abstract
In this paper, we aim ambitiously for a realistic yet challenging problem, namely, how to reconstruct high-quality 3D scenes from sparse low-resolution views that simultaneously suffer from deficient perspectives and clarity. Whereas existing methods only deal with either sparse views or low-resolution observations, they fail to handle such hybrid and complicated scenarios. To this end, we propose a novel Sparse-view Super-resolution 3D Gaussian Splatting framework, dubbed S2Gaussian, that can reconstruct structure-accurate and detail-faithful 3D scenes with only sparse and low-resolution views. The S2Gaussian operates in a two-stage fashion. In the first stage, we initially optimize a low-resolution Gaussian representation with depth regularization and densify it to initialize the high-resolution Gaussians through a tailored Gaussian Shuffle Split operation. In the second stage, we refine the high-resolution Gaussians with the super-resolved images generated from both original sparse views and pseudo-views rendered by the low-resolution Gaussians. In which a customized blur-free inconsistency modeling scheme and a 3D robust optimization strategy are elaborately designed to mitigate multi-view inconsistency and eliminate erroneous updates caused by imperfect supervision. Extensive experiments demonstrate superior results and in particular establishing new state-of-the-art performances with more consistent geometry and finer details.
