Active3D: Active High-Fidelity 3D Reconstruction via Hierarchical Uncertainty Quantification
Yan Li, Yingzhao Li, Gim Hee Lee
TL;DR
Active3D addresses high-fidelity active 3D reconstruction by unifying implicit neural fields with explicit Gaussian primitives into a hybrid scene state. It builds a hierarchical uncertainty map that fuses global structural priors, local surface confidence, and temporal consistency to drive an EHIG-based next-best-view planner and a risk-aware trajectory. A keyframe strategy and a viewpoint-space sliding window enable scalable, uncertainty-aware refinement that preserves global–local consistency. Experimental results on Replica and MP3D demonstrate state-of-the-art accuracy, completeness, and rendering quality, highlighting robustness to occlusion and complex geometries in real-world robotic perception tasks.
Abstract
In this paper, we present an active exploration framework for high-fidelity 3D reconstruction that incrementally builds a multi-level uncertainty space and selects next-best-views through an uncertainty-driven motion planner. We introduce a hybrid implicit-explicit representation that fuses neural fields with Gaussian primitives to jointly capture global structural priors and locally observed details. Based on this hybrid state, we derive a hierarchical uncertainty volume that quantifies both implicit global structure quality and explicit local surface confidence. To focus optimization on the most informative regions, we propose an uncertainty-driven keyframe selection strategy that anchors high-entropy viewpoints as sparse attention nodes, coupled with a viewpoint-space sliding window for uncertainty-aware local refinement. The planning module formulates next-best-view selection as an Expected Hybrid Information Gain problem and incorporates a risk-sensitive path planner to ensure efficient and safe exploration. Extensive experiments on challenging benchmarks demonstrate that our approach consistently achieves state-of-the-art accuracy, completeness, and rendering quality, highlighting its effectiveness for real-world active reconstruction and robotic perception tasks.
