Graph-Guided Scene Reconstruction from Images with 3D Gaussian Splatting
Chong Cheng, Gaochao Song, Yiyang Yao, Qinzheng Zhou, Gangjian Zhang, Hao Wang
TL;DR
This work tackles large-scale 3D scene reconstruction from uncalibrated images by introducing GraphGS, which combines spatial priors for rapid structure estimation with a graph-guided 3D Gaussian Splatting optimization. A camera graph, derived from estimated camera relations, provides topology-aware constraints and an adaptive sampling mechanism to prevent overfitting to sparse viewpoints and accelerate convergence. Key contributions include CNNP for selective pairing, Quadrant Filter to prune noisy matches, an octree-based initialization to reduce points, and a graph-based multi-view consistency loss with betweenness-informed sampling. The method achieves state-of-the-art performance on outdoor benchmarks without ground-truth poses, offering a scalable and practical solution for open-scene reconstruction with potential impact on AR/VR and metaverse applications.
Abstract
This paper investigates an open research challenge of reconstructing high-quality, large 3D open scenes from images. It is observed existing methods have various limitations, such as requiring precise camera poses for input and dense viewpoints for supervision. To perform effective and efficient 3D scene reconstruction, we propose a novel graph-guided 3D scene reconstruction framework, GraphGS. Specifically, given a set of images captured by RGB cameras on a scene, we first design a spatial prior-based scene structure estimation method. This is then used to create a camera graph that includes information about the camera topology. Further, we propose to apply the graph-guided multi-view consistency constraint and adaptive sampling strategy to the 3D Gaussian Splatting optimization process. This greatly alleviates the issue of Gaussian points overfitting to specific sparse viewpoints and expedites the 3D reconstruction process. We demonstrate GraphGS achieves high-fidelity 3D reconstruction from images, which presents state-of-the-art performance through quantitative and qualitative evaluation across multiple datasets. Project Page: https://3dagentworld.github.io/graphgs.
