uSF: Learning Neural Semantic Field with Uncertainty
Vsevolod Skorokhodov, Darya Drozdova, Dmitry Yudin
TL;DR
This work tackles the lack of confidence estimates in NeRF-based 3D scene reconstruction by introducing uSF, a neural field that jointly learns color, semantic labels, and their aleatoric uncertainties. The method adds separate uncertainty heads for RGB and semantic predictions and uses a reparameterization-based Monte Carlo approach to estimate semantic probabilities, all while employing a hash-based positional encoding for efficiency. The learning objective combines color accuracy, semantic classification, and uncertainty regularization via a weighted sum $L = \omega L_{rgb} + \lambda L_{semantic} + (1 - \omega) L_{uncert}$, enabling robust performance with limited training data. Experiments on the Replica dataset show that incorporating uncertainty can improve semantic reconstruction quality and that hash encoding significantly speeds up training compared to baseline NeRF variants, highlighting practical benefits for real-world 3D understanding tasks.
Abstract
Recently, there has been an increased interest in NeRF methods which reconstruct differentiable representation of three-dimensional scenes. One of the main limitations of such methods is their inability to assess the confidence of the model in its predictions. In this paper, we propose a new neural network model for the formation of extended vector representations, called uSF, which allows the model to predict not only color and semantic label of each point, but also estimate the corresponding values of uncertainty. We show that with a small number of images available for training, a model quantifying uncertainty performs better than a model without such functionality. Code of the uSF approach is publicly available at https://github.com/sevashasla/usf/.
