H2O-SDF: Two-phase Learning for 3D Indoor Reconstruction using Object Surface Fields
Minyoung Park, Mirae Do, YeonJae Shin, Jaeseok Yoo, Jongkwang Hong, Joongrock Kim, Chul Lee
TL;DR
Indoor 3D reconstruction struggles to jointly capture smooth room layouts and intricate object surfaces. The authors propose H2O-SDF, a two-phase approach comprising Holistic Surface Learning for global geometry and Object Surface Field (OSF) for object-specific details, augmented by normal-uncertainty based loss reweighting and OSF-guided sampling. The OSF introduces a 3D cue that aligns object surfaces with the SDF without direct SDF supervision, addressing vanishing gradient issues and enabling high-frequency detail recovery via losses $\mathcal{L}_{2d_{osf}}$, $\mathcal{L}_{3d_{osf}}$, and $\mathcal{L}_{ref}$. Extensive ablations and ScanNet evaluations show state-of-the-art geometry quality and improved object-detail fidelity, with robust normal predictions. The work advances practical indoor scene reconstruction and opens avenues for scene editing using the OSF signal as a 3D geometric prior.
Abstract
Advanced techniques using Neural Radiance Fields (NeRF), Signed Distance Fields (SDF), and Occupancy Fields have recently emerged as solutions for 3D indoor scene reconstruction. We introduce a novel two-phase learning approach, H2O-SDF, that discriminates between object and non-object regions within indoor environments. This method achieves a nuanced balance, carefully preserving the geometric integrity of room layouts while also capturing intricate surface details of specific objects. A cornerstone of our two-phase learning framework is the introduction of the Object Surface Field (OSF), a novel concept designed to mitigate the persistent vanishing gradient problem that has previously hindered the capture of high-frequency details in other methods. Our proposed approach is validated through several experiments that include ablation studies.
