Partial-Label Learning with a Reject Option
Tobias Fuchs, Florian Kalinke, Klemens Böhm
TL;DR
This work tackles partial-label learning with ambiguously labeled data and the risk of misclassification in safety-critical settings. It introduces Dst-Pll, a nearest-neighbor PLL method that maintains credal sets via Dempster-Shafer theory and uses Yager's rule to fuse evidence from neighbors. A novel adaptive reject option based on belief and plausibility decides accept vs reject, achieving improved trade-offs between the number and accuracy of non-rejected predictions and proving risk consistency. Empirical results on synthetic and real-world datasets show competitive predictive performance and superior rejection behavior across varying noise conditions, with runtime dominated by nearest-neighbor search. Code and data are released to support reproducibility.
Abstract
In real-world applications, one often encounters ambiguously labeled data, where different annotators assign conflicting class labels. Partial-label learning allows training classifiers in this weakly supervised setting, where state-of-the-art methods already show good predictive performance. However, even the best algorithms give incorrect predictions, which can have severe consequences when they impact actions or decisions. We propose a novel risk-consistent nearest-neighbor-based partial-label learning algorithm with a reject option, that is, the algorithm can reject unsure predictions. Extensive experiments on artificial and real-world datasets show that our method provides the best trade-off between the number and accuracy of non-rejected predictions when compared to our competitors, which use confidence thresholds for rejecting unsure predictions. When evaluated without the reject option, our nearest-neighbor-based approach also achieves competitive prediction performance.
