Robust Robotic Exploration and Mapping Using Generative Occupancy Map Synthesis
Lorin Achey, Alec Reed, Brendan Crowe, Bradley Hayes, Christoffer Heckman
TL;DR
The paper tackles robust robotic exploration under occlusions by introducing SceneSense, a diffusion-based generative occupancy predictor that forecasts unseen geometry from partial observations and merges it probabilistically into a running map. It formalizes a dense-occupancy framework with forward and reverse diffusion, and implements an unconditional denoising network that operates alongside OctoMap, occupancy inpainting, and multi-prediction merging to produce coherent, traversable maps. Empirical results on a quadruped platform show substantial improvements in map fidelity and exploration robustness, including notable FID reductions and better traversal in challenging scenarios such as startup holes and narrow hallways. The work demonstrates that SceneSense can function as a drop-in enhancement to existing planning stacks, enabling more consistent and efficient exploration, with practical implications for real-world robotic deployments and future planning-system integration improvements.
Abstract
We present a novel approach for enhancing robotic exploration by using generative occupancy mapping. We implement SceneSense, a diffusion model designed and trained for predicting 3D occupancy maps given partial observations. Our proposed approach probabilistically fuses these predictions into a running occupancy map in real-time, resulting in significant improvements in map quality and traversability. We deploy SceneSense on a quadruped robot and validate its performance with real-world experiments to demonstrate the effectiveness of the model. In these experiments we show that occupancy maps enhanced with SceneSense predictions better estimate the distribution of our fully observed ground truth data ($24.44\%$ FID improvement around the robot and $75.59\%$ improvement at range). We additionally show that integrating SceneSense enhanced maps into our robotic exploration stack as a ``drop-in'' map improvement, utilizing an existing off-the-shelf planner, results in improvements in robustness and traversability time. Finally, we show results of full exploration evaluations with our proposed system in two dissimilar environments and find that locally enhanced maps provide more consistent exploration results than maps constructed only from direct sensor measurements.
