Early Stopping Against Label Noise Without Validation Data

Suqin Yuan; Lei Feng; Tongliang Liu

Early Stopping Against Label Noise Without Validation Data

Suqin Yuan, Lei Feng, Tongliang Liu

TL;DR

This work tackles the problem of early stopping under label noise without relying on validation data. It proposes Label Wave, a method that uses the prediction changes (PC) on the training set and its moving-average smoothed version to identify the first local minimum as the early stopping point, thereby preventing overfitting to mislabeled samples. The authors formalize stability and variability metrics, reveal a transitional phase called learning confusing patterns, and validate the approach across diverse datasets, architectures, and noise types, showing improvements over traditional hold-out validation and enhancements to existing noisy-label methods. The method offers a practical, data-efficient solution with strong empirical support and suggests further exploration of learning dynamics in noisy settings.

Abstract

Early stopping methods in deep learning face the challenge of balancing the volume of training and validation data, especially in the presence of label noise. Concretely, sparing more data for validation from training data would limit the performance of the learned model, yet insufficient validation data could result in a sub-optimal selection of the desired model. In this paper, we propose a novel early stopping method called Label Wave, which does not require validation data for selecting the desired model in the presence of label noise. It works by tracking the changes in the model's predictions on the training set during the training process, aiming to halt training before the model unduly fits mislabeled data. This method is empirically supported by our observation that minimum fluctuations in predictions typically occur at the training epoch before the model excessively fits mislabeled data. Through extensive experiments, we show both the effectiveness of the Label Wave method across various settings and its capability to enhance the performance of existing methods for learning with noisy labels.

Early Stopping Against Label Noise Without Validation Data

TL;DR

Abstract

Early Stopping Against Label Noise Without Validation Data

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)