Enhanced 3D Urban Scene Reconstruction and Point Cloud Densification using Gaussian Splatting and Google Earth Imagery
Kyle Gao, Dening Lu, Hongjie He, Linlin Xu, Jonathan Li
TL;DR
The paper tackles large-scale urban 3D reconstruction and photorealistic view synthesis from Google Earth imagery. It proposes 3D Gaussian Splatting, with a differentiable rasterizer and per-Gaussian lighting via spherical harmonics, to achieve fast, photorealistic novel-view synthesis and dense geometry densification in remote sensing scenarios. Results on the Waterloo region and BungeeNeRF-style city scenes show that 3DGS can surpass NeRF-based methods in visual quality while offering substantially faster training times, albeit with higher memory demands and some geometric misalignment relative to COLMAP MVS benchmarks. The work highlights the potential of satellite/aerial imagery-driven 3DGS for urban digital twins and GIS applications, while identifying limitations and future directions in memory efficiency, local multi-scale modeling, and semantic 3D reconstruction.
Abstract
3D urban scene reconstruction and modelling is a crucial research area in remote sensing with numerous applications in academia, commerce, industry, and administration. Recent advancements in view synthesis models have facilitated photorealistic 3D reconstruction solely from 2D images. Leveraging Google Earth imagery, we construct a 3D Gaussian Splatting model of the Waterloo region centered on the University of Waterloo and are able to achieve view-synthesis results far exceeding previous 3D view-synthesis results based on neural radiance fields which we demonstrate in our benchmark. Additionally, we retrieved the 3D geometry of the scene using the 3D point cloud extracted from the 3D Gaussian Splatting model which we benchmarked against our Multi- View-Stereo dense reconstruction of the scene, thereby reconstructing both the 3D geometry and photorealistic lighting of the large-scale urban scene through 3D Gaussian Splatting
