Uncertainty Regularized Evidential Regression
Kai Ye, Tiejin Chen, Hua Wei, Liang Zhan
TL;DR
The paper identifies a fundamental weakness in Evidential Regression Networks (ERN): activation constraints that enforce non-negativity can create High Uncertainty Areas (HUA) where gradients vanish and learning stalls. It introduces an uncertainty-regularization term $\mathcal{L}^{U}$ that preserves nonzero gradients within HUA, enabling ERN (and MERN) to learn from the full training set; the approach is validated across cubic regression and monocular depth-estimation tasks, demonstrating improved uncertainty estimation, calibration, and occasionally predictive accuracy, including in out-of-distribution scenarios. The authors extend the regularization to multivariate evidential models with NIW priors, offering a general mechanism to mitigate zero-gradient issues across evidential regression variants. Overall, the work advances the theory and practice of evidential uncertainty learning by providing a robust regularization that enhances learning in previously intractable regions and improves practical performance on real-world tasks.
Abstract
The Evidential Regression Network (ERN) represents a novel approach that integrates deep learning with Dempster-Shafer's theory to predict a target and quantify the associated uncertainty. Guided by the underlying theory, specific activation functions must be employed to enforce non-negative values, which is a constraint that compromises model performance by limiting its ability to learn from all samples. This paper provides a theoretical analysis of this limitation and introduces an improvement to overcome it. Initially, we define the region where the models can't effectively learn from the samples. Following this, we thoroughly analyze the ERN and investigate this constraint. Leveraging the insights from our analysis, we address the limitation by introducing a novel regularization term that empowers the ERN to learn from the whole training set. Our extensive experiments substantiate our theoretical findings and demonstrate the effectiveness of the proposed solution.
