$ν$-DBA: Neural Implicit Dense Bundle Adjustment Enables Image-Only Driving Scene Reconstruction
Yunxuan Mao, Bingqi Shen, Yifei Yang, Kai Wang, Rong Xiong, Yiyi Liao, Yue Wang
TL;DR
$ν$-DBA addresses dense driving-scene reconstruction from RGB imagery with noisy trajectories by coupling a neural implicit 3D surface as the map with a geometric dense bundle adjustment backend driven by dense optical flow. The method uses a close-range Signed Distance Function field parameterized by a 3D hash grid and optimizes both surface and camera poses through volume rendering-based measurements, augmented by self-supervised per-scene flow refinement and optional stereo disparity cues. Through extensive experiments on KITTI-360, Waymo, and a Mine truck dataset, the approach achieves superior trajectory accuracy and denser, more detailed reconstructions compared with neural implicit baselines and traditional SLAM, while revealing how geometric cues versus photometric losses trade off in outdoor scenes. The work demonstrates a practical pathway for high-fidelity outdoor depth, surface detail, and novel-view rendering using neural implicit representations within a principled dense BA framework, with potential impact on autonomous driving perception and simulation.
Abstract
The joint optimization of the sensor trajectory and 3D map is a crucial characteristic of bundle adjustment (BA), essential for autonomous driving. This paper presents $ν$-DBA, a novel framework implementing geometric dense bundle adjustment (DBA) using 3D neural implicit surfaces for map parametrization, which optimizes both the map surface and trajectory poses using geometric error guided by dense optical flow prediction. Additionally, we fine-tune the optical flow model with per-scene self-supervision to further improve the quality of the dense mapping. Our experimental results on multiple driving scene datasets demonstrate that our method achieves superior trajectory optimization and dense reconstruction accuracy. We also investigate the influences of photometric error and different neural geometric priors on the performance of surface reconstruction and novel view synthesis. Our method stands as a significant step towards leveraging neural implicit representations in dense bundle adjustment for more accurate trajectories and detailed environmental mapping.
