Revisiting Essential and Nonessential Settings of Evidential Deep Learning

Mengyuan Chen; Junyu Gao; Changsheng Xu

Revisiting Essential and Nonessential Settings of Evidential Deep Learning

Mengyuan Chen, Junyu Gao, Changsheng Xu

TL;DR

This paper identifies nonessential components in Evidential Deep Learning (EDL) and proposes Re-EDL, a streamlined variant that preserves the essential projected probability from subjective logic while relaxing the prior-weight rigidity, variance-minimizing empirical risk, and KL-divergence regularization. By treating the prior weight as a tunable hyperparameter and directly optimizing the Dirichlet expectation via the projected probability, Re-EDL achieves improved uncertainty quantification and outperforms baselines across classical, few-shot, video-modality, and noisy settings. The essential insight is that projected probability, coupled with a gentler evidence activation (softplus) and a magnitude-aware class score parameter, better preserves logit information and mitigates overconfidence without extra regularization burdens. Empirical results, including OOD detection metrics like AUPR, confirm the method’s effectiveness and suggest a new baseline for single-forward-pass uncertainty estimation in diverse domains.

Abstract

Evidential Deep Learning (EDL) is an emerging method for uncertainty estimation that provides reliable predictive uncertainty in a single forward pass, attracting significant attention. Grounded in subjective logic, EDL derives Dirichlet concentration parameters from neural networks to construct a Dirichlet probability density function (PDF), modeling the distribution of class probabilities. Despite its success, EDL incorporates several nonessential settings: In model construction, (1) a commonly ignored prior weight parameter is fixed to the number of classes, while its value actually impacts the balance between the proportion of evidence and its magnitude in deriving predictive scores. In model optimization, (2) the empirical risk features a variance-minimizing optimization term that biases the PDF towards a Dirac delta function, potentially exacerbating overconfidence. (3) Additionally, the structural risk typically includes a KL-divergence-minimizing regularization, whose optimization direction extends beyond the intended purpose and contradicts common sense, diminishing the information carried by the evidence magnitude. Therefore, we propose Re-EDL, a simplified yet more effective variant of EDL, by relaxing the nonessential settings and retaining the essential one, namely, the adoption of projected probability from subjective logic. Specifically, Re-EDL treats the prior weight as an adjustable hyperparameter rather than a fixed scalar, and directly optimizes the expectation of the Dirichlet PDF provided by deprecating both the variance-minimizing optimization term and the divergence regularization term. Extensive experiments and state-of-the-art performance validate the effectiveness of our method. The source code is available at https://github.com/MengyuanChen21/Re-EDL.

Revisiting Essential and Nonessential Settings of Evidential Deep Learning

TL;DR

Abstract

Paper Structure (34 sections, 40 equations, 9 figures, 18 tables)

This paper contains 34 sections, 40 equations, 9 figures, 18 tables.

Introduction
Related Work
Preliminary: From Subjective Logic to Evidential Deep Learning
Subjective Logic Theory
Evidential Deep Learning
Re-EDL: Revisit Essential and Nonessential Settings of EDL
Relaxing the Rigid Setting of Fixing Prior Weight in Model Construction
Deprecating the Variance-minimizing Optimization Term in Empirical Risk
Delving into KL-divergence-minimizing Regularization on Non-target Evidence in Structural Risk
What are the Essential Settings of EDL?
Experiments
Experimental Setup
Implementation Details
Classical Setting
Few-shot Setting
...and 19 more sections

Figures (9)

Figure 1: A conceptual framework diagram illustrating the essential and nonessential settings discussed in Section \ref{['redl']}. By relaxing the nonessential settings while retaining the essential ones, the proposed Re-EDL method achieves more superior uncertainty estimation with reduced complexity.
Figure 2: Precision-Recall (PR) curves and Receiver Operating Characteristic (ROC) curves of differentiating OOD data (SVHN) from ID data (CIFAR-10) in the classical setting. The adopted uncertainty measure is uncertainty mass (UM).
Figure 3: Parameter analysis of the hyperparameter $\lambda$, evaluated by averaged AUPR for OOD detection on CIFAR-10.
Figure 4: Precision-Recall (PR) curves of differentiating OOD data (SVHN) from ID data (CIFAR-10).
Figure 5: Receiver Operating Characteristic (ROC) curves of differentiating OOD data (SVHN) from ID data (CIFAR-10).
...and 4 more figures

Revisiting Essential and Nonessential Settings of Evidential Deep Learning

TL;DR

Abstract

Revisiting Essential and Nonessential Settings of Evidential Deep Learning

Authors

TL;DR

Abstract

Table of Contents

Figures (9)