Efficient Adaptive Label Refinement for Label Noise Learning
Wenzhen Zhang, Debo Cheng, Guangquan Lu, Bo Zhou, Jiaye Li, Shichao Zhang
TL;DR
This work tackles the problem of deep models memorizing noisy labels by decoupling robust learning from label denoising. It introduces Adaptive Label Refinement (ALR), a two-phase method that first trains with hard labels and then progressively refines targets to soft distributions via temporal ensembling, supplemented by a small entropy regularization to emphasize learning from clean samples. ALR does not require prior knowledge of noise distributions or external clean data and demonstrates state-of-the-art performance on CIFAR-10/100 with artificial noise and real-world noisy datasets (ANIMAL-10N, Clothing1M, WebVision) along with comprehensive ablations. The approach is simple, effective, and broadly applicable, offering a practical solution for robust learning under label noise.
Abstract
Deep neural networks are highly susceptible to overfitting noisy labels, which leads to degraded performance. Existing methods address this issue by employing manually defined criteria, aiming to achieve optimal partitioning in each iteration to avoid fitting noisy labels while thoroughly learning clean samples. However, this often results in overly complex and difficult-to-train models. To address this issue, we decouple the tasks of avoiding fitting incorrect labels and thoroughly learning clean samples and propose a simple yet highly applicable method called Adaptive Label Refinement (ALR). First, inspired by label refurbishment techniques, we update the original hard labels to soft labels using the model's predictions to reduce the risk of fitting incorrect labels. Then, by introducing the entropy loss, we gradually `harden' the high-confidence soft labels, guiding the model to better learn from clean samples. This approach is simple and efficient, requiring no prior knowledge of noise or auxiliary datasets, making it more accessible compared to existing methods. We validate ALR's effectiveness through experiments on benchmark datasets with artificial label noise (CIFAR-10/100) and real-world datasets with inherent noise (ANIMAL-10N, Clothing1M, WebVision). The results show that ALR outperforms state-of-the-art methods.
