GeoGS3D: Single-view 3D Reconstruction via Geometric-aware Diffusion Model and Gaussian Splatting
Qijun Feng, Zhen Xing, Zuxuan Wu, Yu-Gang Jiang
TL;DR
<3-5 sentence high-level summary> GeoGS3D tackles single-view 3D reconstruction by first generating geometry-aware, multi-view images from a single input using an orthogonal-plane decomposition within a diffusion framework. It then reconstructs a high-fidelity 3D Gaussian representation by fusing the views with epipolar attention and accelerating optimization via Gaussian Divergent Significance (GDS). The approach demonstrates strong multi-view consistency and high-quality 3D geometry on Objaverse and Google Scanned Object datasets, outperforming state-of-the-art image-to-3D baselines. This two-stage framework effectively leverages pre-trained 2D diffusion models to enable detailed 3D reconstruction with improved efficiency and geometric fidelity.</paper_summary>
Abstract
We introduce GeoGS3D, a novel two-stage framework for reconstructing detailed 3D objects from single-view images. Inspired by the success of pre-trained 2D diffusion models, our method incorporates an orthogonal plane decomposition mechanism to extract 3D geometric features from the 2D input, facilitating the generation of multi-view consistent images. During the following Gaussian Splatting, these images are fused with epipolar attention, fully utilizing the geometric correlations across views. Moreover, we propose a novel metric, Gaussian Divergence Significance (GDS), to prune unnecessary operations during optimization, significantly accelerating the reconstruction process. Extensive experiments demonstrate that GeoGS3D generates images with high consistency across views and reconstructs high-quality 3D objects, both qualitatively and quantitatively.
