A Survey of Label-noise Representation Learning: Past, Present and Future
Bo Han, Quanming Yao, Tongliang Liu, Gang Niu, Ivor W. Tsang, James T. Kwok, Masashi Sugiyama
TL;DR
The survey formalizes Label-noise Representation Learning (LNRL) and analyzes why noisy labels degrade deep models from data, objective, and optimization perspectives. It offers a unified taxonomy distinguishing data-driven noise modeling, loss/regularization design, and memorization-based optimization strategies, and it reviews representative methods across the three axes (e.g., noise-transition layers, forward/backward correction, MentorNet/Co-teaching, Mixup, DivideMix). Key contributions include articulating essential components for robust LNRL, synthesizing theoretical and empirical insights, and outlining future directions such as instance-dependent noise and adversarial LNRL. The work emphasizes datasets, theoretical guarantees, and practical guidelines to advance robust learning in real-world, noisy-label settings across vision, language, and beyond.
Abstract
Classical machine learning implicitly assumes that labels of the training data are sampled from a clean distribution, which can be too restrictive for real-world scenarios. However, statistical-learning-based methods may not train deep learning models robustly with these noisy labels. Therefore, it is urgent to design Label-Noise Representation Learning (LNRL) methods for robustly training deep models with noisy labels. To fully understand LNRL, we conduct a survey study. We first clarify a formal definition for LNRL from the perspective of machine learning. Then, via the lens of learning theory and empirical study, we figure out why noisy labels affect deep models' performance. Based on the theoretical guidance, we categorize different LNRL methods into three directions. Under this unified taxonomy, we provide a thorough discussion of the pros and cons of different categories. More importantly, we summarize the essential components of robust LNRL, which can spark new directions. Lastly, we propose possible research directions within LNRL, such as new datasets, instance-dependent LNRL, and adversarial LNRL. We also envision potential directions beyond LNRL, such as learning with feature-noise, preference-noise, domain-noise, similarity-noise, graph-noise and demonstration-noise.
