Selfish Evolution: Making Discoveries in Extreme Label Noise with the Help of Overfitting Dynamics
Nima Sedaghat, Tanawan Chatchadanoraset, Colin Orion Chandler, Ashish Mahabal, Maryam Eslami
TL;DR
This work addresses the challenge of scarce and noisy labels in scientific domains by introducing Selfish Evolution, a method that uses per-sample overfitting dynamics to reveal noise patterns and recover true labels. It constructs evolution cubes $\mathcal{E}_i=\{y_i^t\}_{t=1}^T$ during targeted overfitting and trains a secondary Evolution-to-Label mapper $g(\mathcal{E};\phi)$ to predict corrected labels, optionally looping in a closed-loop to gradually denoise the dataset. The approach is network-state-agnostic and designed to work with image-like labels, demonstrated on supernova detection in DESC/DC2 data where $817$ missed events were recovered, and on MNIST to validate correction under high noise with competitive baselines. The method’s combination of evolution-based label correction and closed-loop refinement offers practical impact for domain-specific labeling bottlenecks and has potential for broader domain adaptation across imaging tasks. All mathematical notation is kept within $...$ delimiters to ensure precise, machine-readable representation of the core concepts.
Abstract
Motivated by the scarcity of proper labels in an astrophysical application, we have developed a novel technique, called Selfish Evolution, which allows for the detection and correction of corrupted labels in a weakly supervised fashion. Unlike methods based on early stopping, we let the model train on the noisy dataset. Only then do we intervene and allow the model to overfit to individual samples. The ``evolution'' of the model during this process reveals patterns with enough information about the noisiness of the label, as well as its correct version. We train a secondary network on these spatiotemporal ``evolution cubes'' to correct potentially corrupted labels. We incorporate the technique in a closed-loop fashion, allowing for automatic convergence towards a mostly clean dataset, without presumptions about the state of the network in which we intervene. We evaluate on the main task of the Supernova-hunting dataset but also demonstrate efficiency on the more standard MNIST dataset.
