Set a Thief to Catch a Thief: Combating Label Noise through Noisy Meta Learning
Hanxuan Wang, Na Lu, Xueying Zhao, Yuxuan Yan, Kaipeng Ma, Kwoh Chee Keong, Gustavo Carneiro
TL;DR
This paper addresses learning from noisy labels by removing the need for extra clean validation data. It introduces STCT, a noisy meta label correction framework that uses a randomly sampled noisy validation set drawn from the training data to guide label correction via an alternating between noisy meta correction and semi-supervised representation learning. The approach is theoretically motivated by consistency results between optimal classifiers on noisy and clean distributions and is operationalized through a non-nested optimization that decouples representation learning from label correction. Extensive experiments on CIFAR-10/100 under symmetric and asymmetric noise, as well as real-world datasets like CIFAR-10N, CIFAR-100N, and Clothing-1M, demonstrate superior label correction and classification performance, particularly at high noise rates. STCT thus offers a practical, annotation-light alternative to traditional meta-learning label correction methods with strong empirical upside and potential broad impact for robust learning in noisy-label regimes.
Abstract
Learning from noisy labels (LNL) aims to train high-performance deep models using noisy datasets. Meta learning based label correction methods have demonstrated remarkable performance in LNL by designing various meta label rectification tasks. However, extra clean validation set is a prerequisite for these methods to perform label correction, requiring extra labor and greatly limiting their practicality. To tackle this issue, we propose a novel noisy meta label correction framework STCT, which counterintuitively uses noisy data to correct label noise, borrowing the spirit in the saying ``Set a Thief to Catch a Thief''. The core idea of STCT is to leverage noisy data which is i.i.d. with the training data as a validation set to evaluate model performance and perform label correction in a meta learning framework, eliminating the need for extra clean data. By decoupling the complex bi-level optimization in meta learning into representation learning and label correction, STCT is solved through an alternating training strategy between noisy meta correction and semi-supervised representation learning. Extensive experiments on synthetic and real-world datasets demonstrate the outstanding performance of STCT, particularly in high noise rate scenarios. STCT achieves 96.9% label correction and 95.2% classification performance on CIFAR-10 with 80% symmetric noise, significantly surpassing the current state-of-the-art.
