Drainage: A Unifying Framework for Addressing Class Uncertainty
Yasser Taha, Grégoire Montavon, Nils Körber
TL;DR
Drainage introduces a drainage node appended to the output layer to explicitly allocate uncertainty, enabling robust handling of noisy and ambiguous labels. The drainage loss incentivizes reallocating probability to the drainage and target classes while suppressing non-targets, with proven monotonicity and convexity properties. Empirically, Drainage yields substantial gains under high noise on CIFAR-10/100 and competitive results on WebVision, Clothing-1M, and ILSVRC-12, while providing qualitative evidence of denoising and more stable decision boundaries. The approach also supports Open Set Recognition by supplying an explicit unknown score, illustrating its versatility for both robust classification and open-set tasks with end-to-end training.
Abstract
Modern deep learning faces significant challenges with noisy labels, class ambiguity, as well as the need to robustly reject out-of-distribution or corrupted samples. In this work, we propose a unified framework based on the concept of a "drainage node'' which we add at the output of the network. The node serves to reallocate probability mass toward uncertainty, while preserving desirable properties such as end-to-end training and differentiability. This mechanism provides a natural escape route for highly ambiguous, anomalous, or noisy samples, particularly relevant for instance-dependent and asymmetric label noise. In systematic experiments involving the addition of varying proportions of instance-dependent noise or asymmetric noise to CIFAR-10/100 labels, our drainage formulation achieves an accuracy increase of up to 9\% over existing approaches in the high-noise regime. Our results on real-world datasets, such as mini-WebVision, mini-ImageNet and Clothing-1M, match or surpass existing state-of-the-art methods. Qualitative analysis reveals a denoising effect, where the drainage neuron consistently absorbs corrupt, mislabeled, or outlier data, leading to more stable decision boundaries. Furthermore, our drainage formulation enables applications well beyond classification, with immediate benefits for web-scale, semi-supervised dataset cleaning, and open-set applications.
