MetroGS: Efficient and Stable Reconstruction of Geometrically Accurate High-Fidelity Large-Scale Scenes
Kehua Chen, Tianlu Mao, Zhuxin Ma, Hao Jiang, Zehao Li, Zihan Liu, Shuqi Gao, Honglong Zhao, Feng Dai, Yucheng Zhang, Zhaoqi Wang
TL;DR
MetroGS tackles the challenge of geometrically faithful, large-scale scene reconstruction by combining a distributed 2D Gaussian Splatting backbone with a structured dense initialization, sparsity-aware densification, a progressive hybrid geometric refinement, and a depth-guided appearance model. The approach unifies geometry and appearance through a depth-consistent feature representation and per-image embeddings, enabling robust reconstruction under lighting and data variability. Key contributions include pointmap-assisted initialization, sparsity compensation densification, PatchMatch-based multi-view refinement, and depth-guided appearance modeling with Tri-Mip features, all trained in a scalable, GPU-friendly framework. Experiments on GauU-Scene and MatrixCity demonstrate superior geometric accuracy, rendering quality, and training efficiency compared with state-of-the-art baselines, highlighting practical impact for large-scale urban reconstruction.
Abstract
Recently, 3D Gaussian Splatting and its derivatives have achieved significant breakthroughs in large-scale scene reconstruction. However, how to efficiently and stably achieve high-quality geometric fidelity remains a core challenge. To address this issue, we introduce MetroGS, a novel Gaussian Splatting framework for efficient and robust reconstruction in complex urban environments. Our method is built upon a distributed 2D Gaussian Splatting representation as the core foundation, serving as a unified backbone for subsequent modules. To handle potential sparse regions in complex scenes, we propose a structured dense enhancement scheme that utilizes SfM priors and a pointmap model to achieve a denser initialization, while incorporating a sparsity compensation mechanism to improve reconstruction completeness. Furthermore, we design a progressive hybrid geometric optimization strategy that organically integrates monocular and multi-view optimization to achieve efficient and accurate geometric refinement. Finally, to address the appearance inconsistency commonly observed in large-scale scenes, we introduce a depth-guided appearance modeling approach that learns spatial features with 3D consistency, facilitating effective decoupling between geometry and appearance and further enhancing reconstruction stability. Experiments on large-scale urban datasets demonstrate that MetroGS achieves superior geometric accuracy, rendering quality, offering a unified solution for high-fidelity large-scale scene reconstruction.
