OmniSDF: Scene Reconstruction using Omnidirectional Signed Distance Functions and Adaptive Binoctrees
Hakyeong Kim, Andreas Meuleman, Hyeonjoong Jang, James Tompkin, Min H. Kim
TL;DR
OmniSDF delivers a neural SDF reconstruction framework tailored for short-baseline omnidirectional video by employing an adaptive spherical binoctree (sphoxels) to concentrate sampling near surfaces and manage memory. It starts from a depth-guided binoctree initialization and iteratively refines sampling through coarse-to-fine subdivisions, using a BFS-based traversal to handle irregular sphoxel shapes. The approach achieves detailed large-scale scene geometry with significantly fewer voxels than Cartesian grids and shows competitive or superior accuracy compared with traditional and neural baselines on synthetic and real omnidirectional data. This yields practical benefits for real-time-like scene understanding in indoor/outdoor settings, with publicly available code supporting research replication.
Abstract
We present a method to reconstruct indoor and outdoor static scene geometry and appearance from an omnidirectional video moving in a small circular sweep. This setting is challenging because of the small baseline and large depth ranges, making it difficult to find ray crossings. To better constrain the optimization, we estimate geometry as a signed distance field within a spherical binoctree data structure and use a complementary efficient tree traversal strategy based on a breadth-first search for sampling. Unlike regular grids or trees, the shape of this structure well-matches the camera setting, creating a better memory-quality trade-off. From an initial depth estimate, the binoctree is adaptively subdivided throughout the optimization; previous methods use a fixed depth that leaves the scene undersampled. In comparison with three neural optimization methods and two non-neural methods, ours shows decreased geometry error on average, especially in a detailed scene, while significantly reducing the required number of voxels to represent such details.
