Learning Scene-Level Signed Directional Distance Function with Ellipsoidal Priors and Neural Residuals
Zhirui Dai, Hojoon Shin, Yulun Tian, Ki Myung Brian Lee, Nikolay Atanasov
TL;DR
This work introduces a scene-level signed directional distance function (SDDF) and a hybrid explicit-implicit model that combines an ellipsoid-based Prior Network with a neural Residual Network to enable fast, differentiable directional distance queries. The prior provides a coarse, differentiable geometric scaffold, while the Latent Feature Network and Residual Decoder refine fine details, ensuring the SDDF satisfies the directional Eikonal equation by construction: $f(p,v)=f(p,v)+\delta_f$. Empirically, the approach is competitive with state-of-the-art neural implicit scene models in reconstruction accuracy and rendering efficiency on Replica, Gibson, and ScanNet datasets, while enabling differentiable viewpoint optimization for active navigation and exploration. This framework offers a practical pathway toward efficient, differentiable scene representations for robotics and automated exploration.
Abstract
Dense geometric environment representations are critical for autonomous mobile robot navigation and exploration. Recent work shows that implicit continuous representations of occupancy, signed distance, or radiance learned using neural networks offer advantages in reconstruction fidelity, efficiency, and differentiability over explicit discrete representations based on meshes, point clouds, and voxels. In this work, we explore a directional formulation of signed distance, called signed directional distance function (SDDF). Unlike signed distance function (SDF) and similar to neural radiance fields (NeRF), SDDF has a position and viewing direction as input. Like SDF and unlike NeRF, SDDF directly provides distance to the observed surface along the direction, rather than integrating along the view ray, allowing efficient view synthesis. To learn and predict scene-level SDDF efficiently, we develop a differentiable hybrid representation that combines explicit ellipsoid priors and implicit neural residuals. This approach allows the model to effectively handle large distance discontinuities around obstacle boundaries while preserving the ability for dense high-fidelity prediction. We show that SDDF is competitive with the state-of-the-art neural implicit scene models in terms of reconstruction accuracy and rendering efficiency, while allowing differentiable view prediction for robot trajectory optimization.
