InfoNorm: Mutual Information Shaping of Normals for Sparse-View Reconstruction
Xulong Wang, Siyan Dong, Youyi Zheng, Yanchao Yang
TL;DR
InfoNorm introduces mutual information shaping of surface normals to regularize geometry in SDF-based NeRFs for sparse-view indoor reconstruction. By identifying geometrically correlated regions via multimodal semantic and monocular geometric features and enforcing an InfoNCE-style loss on normals, the method provides a robust, plug-in improvement to multiple baselines. It demonstrates consistent gains on ScanNet++ and Replica, with ablations validating the importance of feature fusion and the normal-based MI formulation. The approach offers a practical route to enhance 3D geometry without heavily altering model architectures, at the cost of some training-time overhead that scales with the underlying network. Overall, InfoNorm improves surface quality and sharpness in challenging sparse-view scenarios while maintaining compatibility with diverse NeRF/SDF pipelines.
Abstract
3D surface reconstruction from multi-view images is essential for scene understanding and interaction. However, complex indoor scenes pose challenges such as ambiguity due to limited observations. Recent implicit surface representations, such as Neural Radiance Fields (NeRFs) and signed distance functions (SDFs), employ various geometric priors to resolve the lack of observed information. Nevertheless, their performance heavily depends on the quality of the pre-trained geometry estimation models. To ease such dependence, we propose regularizing the geometric modeling by explicitly encouraging the mutual information among surface normals of highly correlated scene points. In this way, the geometry learning process is modulated by the second-order correlations from noisy (first-order) geometric priors, thus eliminating the bias due to poor generalization. Additionally, we introduce a simple yet effective scheme that utilizes semantic and geometric features to identify correlated points, enhancing their mutual information accordingly. The proposed technique can serve as a plugin for SDF-based neural surface representations. Our experiments demonstrate the effectiveness of the proposed in improving the surface reconstruction quality of major states of the arts. Our code is available at: \url{https://github.com/Muliphein/InfoNorm}.
