Direct Sparse Odometry with Continuous 3D Gaussian Maps for Indoor Environments
Jie Deng, Fengtian Lang, Zikang Yuan, Xin Yang
TL;DR
The paper tackles indoor monocular VO by addressing depth-approximation errors from discrete prior maps. It introduces a continuous 3D Gaussian map rendered via differentiable Gaussian splatting to provide depth $d_p$ for every pixel, enabling direct photometric optimization without interpolation. The approach comprises a global map module that renders depth maps from a Gaussian map and a local odometry that uses pixel-depth pairs for pose estimation, with a sliding-window BA to refine keyframe trajectories. Experimental results on two public datasets show improved tracking accuracy and robustness over baselines, and the authors release their code to foster community development.
Abstract
Accurate localization is essential for robotics and augmented reality applications such as autonomous navigation. Vision-based methods combining prior maps aim to integrate LiDAR-level accuracy with camera cost efficiency for robust pose estimation. Existing approaches, however, often depend on unreliable interpolation procedures when associating discrete point cloud maps with dense image pixels, which inevitably introduces depth errors and degrades pose estimation accuracy. We propose a monocular visual odometry framework utilizing a continuous 3D Gaussian map, which directly assigns geometrically consistent depth values to all extracted high-gradient points without interpolation. Evaluations on two public datasets demonstrate superior tracking accuracy compared to existing methods. We have released the source code of this work for the development of the community.
