Rethinking Epistemic and Aleatoric Uncertainty for Active Open-Set Annotation: An Energy-Based Approach

Chen-Chen Zong; Sheng-Jun Huang

Rethinking Epistemic and Aleatoric Uncertainty for Active Open-Set Annotation: An Energy-Based Approach

Chen-Chen Zong, Sheng-Jun Huang

TL;DR

The paper tackles active open-set annotation by clarifying that relying solely on epistemic or aleatoric uncertainty is suboptimal in open-world settings. It introduces Energy-based Active Open-set Annotation (EAOA), pairing a $(C+1)$-class detector with a $C$-class target classifier and defining energy-based epistemic and aleatoric uncertainties, augmented by a margin-based energy loss and a target-driven adaptive sampling strategy. Empirical results on CIFAR-10/100 and Tiny-ImageNet show state-of-the-art performance in test accuracy and query precision with low overhead, validating the effectiveness of jointly leveraging EU and AU. The work offers practical implications for labeling efficiency in real-world open-set scenarios and provides code to foster reproducibility and further development.

Abstract

Active learning (AL), which iteratively queries the most informative examples from a large pool of unlabeled candidates for model training, faces significant challenges in the presence of open-set classes. Existing methods either prioritize query examples likely to belong to known classes, indicating low epistemic uncertainty (EU), or focus on querying those with highly uncertain predictions, reflecting high aleatoric uncertainty (AU). However, they both yield suboptimal performance, as low EU corresponds to limited useful information, and closed-set AU metrics for unknown class examples are less meaningful. In this paper, we propose an Energy-based Active Open-set Annotation (EAOA) framework, which effectively integrates EU and AU to achieve superior performance. EAOA features a $(C+1)$-class detector and a target classifier, incorporating an energy-based EU measure and a margin-based energy loss designed for the detector, alongside an energy-based AU measure for the target classifier. Another crucial component is the target-driven adaptive sampling strategy. It first forms a smaller candidate set with low EU scores to ensure closed-set properties, making AU metrics meaningful. Subsequently, examples with high AU scores are queried to form the final query set, with the candidate set size adjusted adaptively. Extensive experiments show that EAOA achieves state-of-the-art performance while maintaining high query precision and low training overhead. The code is available at https://github.com/chenchenzong/EAOA.

Rethinking Epistemic and Aleatoric Uncertainty for Active Open-Set Annotation: An Energy-Based Approach

TL;DR

-class detector with a

-class target classifier and defining energy-based epistemic and aleatoric uncertainties, augmented by a margin-based energy loss and a target-driven adaptive sampling strategy. Empirical results on CIFAR-10/100 and Tiny-ImageNet show state-of-the-art performance in test accuracy and query precision with low overhead, validating the effectiveness of jointly leveraging EU and AU. The work offers practical implications for labeling efficiency in real-world open-set scenarios and provides code to foster reproducibility and further development.

Abstract

-class detector and a target classifier, incorporating an energy-based EU measure and a margin-based energy loss designed for the detector, alongside an energy-based AU measure for the target classifier. Another crucial component is the target-driven adaptive sampling strategy. It first forms a smaller candidate set with low EU scores to ensure closed-set properties, making AU metrics meaningful. Subsequently, examples with high AU scores are queried to form the final query set, with the candidate set size adjusted adaptively. Extensive experiments show that EAOA achieves state-of-the-art performance while maintaining high query precision and low training overhead. The code is available at https://github.com/chenchenzong/EAOA.

Rethinking Epistemic and Aleatoric Uncertainty for Active Open-Set Annotation: An Energy-Based Approach

TL;DR

Abstract

Rethinking Epistemic and Aleatoric Uncertainty for Active Open-Set Annotation: An Energy-Based Approach

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (10)

Theorems & Definitions (2)