HO-Gaussian: Hybrid Optimization of 3D Gaussian Splatting for Urban Scenes
Zhuopeng Li, Yilin Zhang, Chenming Wu, Jianke Zhu, Liangjun Zhang
TL;DR
HO-Gaussian addresses the limitations of 3D Gaussian Splatting in urban scenes by removing reliance on SfM point initialization and introducing a grid-based volume to guide Gaussian optimization. It introduces Gaussian Position Encoding and Gaussian Directional Encoding to efficiently represent geometry and view-dependent color, plus Neural Warping to ensure multi-camera consistency, together with Point Densitification to fill low-texture and distant regions. The method jointly optimizes Gaussian parameters and grid-volume attributes with a hybrid loss, achieving real-time, photo-realistic novel-view rendering on Waymo and Argoverse without heavy storage demands. Experimental results demonstrate substantial improvements over both NeRF-based urban methods and SfM/LiDAR-reliant 3DGS baselines, with favorable texture and geometry quality and improved efficiency for large-scale urban scenes.
Abstract
The rapid growth of 3D Gaussian Splatting (3DGS) has revolutionized neural rendering, enabling real-time production of high-quality renderings. However, the previous 3DGS-based methods have limitations in urban scenes due to reliance on initial Structure-from-Motion(SfM) points and difficulties in rendering distant, sky and low-texture areas. To overcome these challenges, we propose a hybrid optimization method named HO-Gaussian, which combines a grid-based volume with the 3DGS pipeline. HO-Gaussian eliminates the dependency on SfM point initialization, allowing for rendering of urban scenes, and incorporates the Point Densitification to enhance rendering quality in problematic regions during training. Furthermore, we introduce Gaussian Direction Encoding as an alternative for spherical harmonics in the rendering pipeline, which enables view-dependent color representation. To account for multi-camera systems, we introduce neural warping to enhance object consistency across different cameras. Experimental results on widely used autonomous driving datasets demonstrate that HO-Gaussian achieves photo-realistic rendering in real-time on multi-camera urban datasets.
