ActiveLab: Active Learning with Re-Labeling by Multiple Annotators
Hui Wen Goh, Jonas Mueller
TL;DR
ActiveLab tackles label-noise in pool-based active learning with multiple annotators by fusing out-of-sample classifier predictions and annotator labels into a per-example acquisition score s_i. It extends CROWDLAB to actively decide between labeling new data and re-labeling existing data, estimating annotator trust via w_j, global P, and model weight w_M, and calibrates predictions with temperature scaling before scoring. The method supports single-model and ensemble configurations, and demonstrates superior performance on both tabular and image tasks with fewer total annotations, including effective active label cleaning. This approach offers a practical, model- and modality-agnostic framework for robust learning under annotation noise, with significant implications for real-world labeling budgets and data quality.
Abstract
In real-world data labeling applications, annotators often provide imperfect labels. It is thus common to employ multiple annotators to label data with some overlap between their examples. We study active learning in such settings, aiming to train an accurate classifier by collecting a dataset with the fewest total annotations. Here we propose ActiveLab, a practical method to decide what to label next that works with any classifier model and can be used in pool-based batch active learning with one or multiple annotators. ActiveLab automatically estimates when it is more informative to re-label examples vs. labeling entirely new ones. This is a key aspect of producing high quality labels and trained models within a limited annotation budget. In experiments on image and tabular data, ActiveLab reliably trains more accurate classifiers with far fewer annotations than a wide variety of popular active learning methods.
