J-NeuS: Joint field optimization for Neural Surface reconstruction in urban scenes with limited image overlap
Fusang Wang, Hala Djeghim, Nathan Piasco, Moussab Bennehar, Luis Roldão, Yizhe WU, Fabien Moutarde, Désiré Sidibé, Dzmitry Tsishkou
TL;DR
J-NeuS addresses the challenge of urban surface reconstruction from driving sequences with limited image overlap by jointly optimizing a volumetric NeRF and a Signed Distance Function (SDF) while explicitly modeling cross-representation uncertainty. The method introduces photometric and geometric uncertainty signals to guide Guided Ray Sampling, enabling the volumetric and surface representations to specialize where they perform best. Adaptive relaxation of geometric regularization prevents over-smoothing in uncertain areas, preserving fine structures essential for autonomous driving tasks. Extensive experiments on KITTI-360, Pandaset, Waymo, and nuScenes show that J-NeuS achieves state-of-the-art or near-state-of-the-art 3D reconstruction quality with efficient mesh extraction, demonstrating strong practical impact for urban scene modeling under challenging observation conditions.
Abstract
Reconstructing the surrounding surface geometry from recorded driving sequences poses a significant challenge due to the limited image overlap and complex topology of urban environments. SoTA neural implicit surface reconstruction methods often struggle in such setting, either failing due to small vision overlap or exhibiting suboptimal performance in accurately reconstructing both the surface and fine structures. To address these limitations, we introduce J-NeuS, a novel hybrid implicit surface reconstruction method for large driving sequences with outward facing camera poses. J-NeuS cross-representation uncertainty estimation to tackle ambiguous geometry caused by limited observations. Our method performs joint optimization of two radiance fields in addition to guided sampling achieving accurate reconstruction of large areas along with fine structures in complex urban scenarios. Extensive evaluation on major driving datasets demonstrates the superiority of our approach in reconstructing large driving sequences with limited image overlap, outperforming concurrent SoTA methods.
