Taking Class Imbalance Into Account in Open Set Recognition Evaluation
Joanna Komorniczak, Pawel Ksieniewicz
TL;DR
Open Set Recognition (OSR) faces evaluation challenges under real-world imbalances between known (kkc) and unknown (uuc) classes. The authors analyze how standard metrics and evaluation protocols distort OSR performance as Openness and $kkc$/$uuc$ distributions vary, and propose extending the evaluation with four metrics—Inner, Outer, Halfpoint, and Overall—derived from Balanced Accuracy. Through experiments with discriminative and generative baselines on CIFAR10/SVHN and MNIST/Omniglot configurations, they show that metric choice and data imbalance critically influence conclusions, with generative methods often excelling under the Outer/Overall measures while discriminative methods perform better on the Inner score. The paper delivers practical guidelines for robust OSR evaluation in imbalanced, open-world settings, advocating multiple configurations, repetition, and the Halfpoint/Overall metrics to penalize false unknowns and better reflect real-world performance.
Abstract
In recent years Deep Neural Network-based systems are not only increasing in popularity but also receive growing user trust. However, due to the closed-world assumption of such systems, they cannot recognize samples from unknown classes and often induce an incorrect label with high confidence. Presented work looks at the evaluation of methods for Open Set Recognition, focusing on the impact of class imbalance, especially in the dichotomy between known and unknown samples. As an outcome of problem analysis, we present a set of guidelines for evaluation of methods in this field.
