Adaptive Negative Evidential Deep Learning for Open-set Semi-supervised Learning
Yang Yu, Danruo Deng, Furui Liu, Yueming Jin, Qi Dou, Guangyong Chen, Pheng-Ann Heng
TL;DR
Open-set SSL faces the challenge of unlabeled data containing novel categories. The paper proposes Adaptive Negative Evidential Deep Learning (ANEDL), which combines an evidential outlier detector with a traditional Softmax classifier and introduces adaptive negative optimization to suppress outlier evidence while preserving inlier learning. It also defines dedicated uncertainty metrics for self-training ($M_{Self-training}$) and inference ($M_{Inference}$), leveraging Fisher information via Dirichlet evidence to guide learning and outlier detection. Empirically, ANEDL achieves state-of-the-art AUROC and lower error rates on CIFAR-10/100, ImageNet-30, and Mini-ImageNet, particularly when the inlier class set is large. The approach enhances scalability and robustness of open-set SSL, enabling better utilization of unlabeled data in real-world settings.
Abstract
Semi-supervised learning (SSL) methods assume that labeled data, unlabeled data and test data are from the same distribution. Open-set semi-supervised learning (Open-set SSL) considers a more practical scenario, where unlabeled data and test data contain new categories (outliers) not observed in labeled data (inliers). Most previous works focused on outlier detection via binary classifiers, which suffer from insufficient scalability and inability to distinguish different types of uncertainty. In this paper, we propose a novel framework, Adaptive Negative Evidential Deep Learning (ANEDL) to tackle these limitations. Concretely, we first introduce evidential deep learning (EDL) as an outlier detector to quantify different types of uncertainty, and design different uncertainty metrics for self-training and inference. Furthermore, we propose a novel adaptive negative optimization strategy, making EDL more tailored to the unlabeled dataset containing both inliers and outliers. As demonstrated empirically, our proposed method outperforms existing state-of-the-art methods across four datasets.
