Relation Modeling and Distillation for Learning with Noisy Labels
Xiaming Che, Junlin Zhang, Zhuang Qi, Xin Qi
TL;DR
This work tackles the challenge of learning with noisy labels, where label corruption degrades representation learning and model robustness. It introduces RMDNet, a plug-and-play framework with two modules: RM, which learns inter-sample relations through self-supervised learning (via a SimSiam-like setup) and constructs a relation graph, and RGRL, which distills this relational knowledge into the task network to calibrate representations of noisy samples. The approach combines a relation-based regularizer with standard cross-entropy loss, controlled by a hyperparameter K, and demonstrates improvements across CIFAR-10/100 under multiple noise types. Key contributions include the design of edge and node matching losses for graph distillation, comprehensive ablations, parameter analyses, and case studies illustrating improved separation of noisy and clean data. The findings suggest that incorporating relation modeling and knowledge distillation yields more robust representations and can be integrated with existing noisy-label methods to enhance performance in real-world settings.
Abstract
Learning with noisy labels has become an effective strategy for enhancing the robustness of models, which enables models to better tolerate inaccurate data. Existing methods either focus on optimizing the loss function to mitigate the interference from noise, or design procedures to detect potential noise and correct errors. However, their effectiveness is often compromised in representation learning due to the dilemma where models overfit to noisy labels. To address this issue, this paper proposes a relation modeling and distillation framework that models inter-sample relationships via self-supervised learning and employs knowledge distillation to enhance understanding of latent associations, which mitigate the impact of noisy labels. Specifically, the proposed method, termed RMDNet, includes two main modules, where the relation modeling (RM) module implements the contrastive learning technique to learn representations of all data, an unsupervised approach that effectively eliminates the interference of noisy tags on feature extraction. The relation-guided representation learning (RGRL) module utilizes inter-sample relation learned from the RM module to calibrate the representation distribution for noisy samples, which is capable of improving the generalization of the model in the inference phase. Notably, the proposed RMDNet is a plug-and-play framework that can integrate multiple methods to its advantage. Extensive experiments were conducted on two datasets, including performance comparison, ablation study, in-depth analysis and case study. The results show that RMDNet can learn discriminative representations for noisy data, which results in superior performance than the existing methods.
