Parameter-tuning-free data entry error unlearning with adaptive selective synaptic dampening
Stefan Schoepf, Jack Foster, Alexandra Brintrup
TL;DR
The paper tackles the problem of data-entry label errors degrading model performance and the high cost of full retraining. It introduces Adaptive Selective Synaptic Dampening (ASSD), a parameter-free extension of Selective Synaptic Dampening (SSD) that automatically selects a forgetting aggressiveness using a percentile of relative Fisher Information importances, enabling forgetting of the erroneous data without tuning. The authors validate ASSD on standard unlearning benchmarks with ResNet18 and Vision Transformer, and demonstrate its practical viability on a real-world supply-chain delay dataset with varying label-error rates, where it matches or outperforms fine-tuning and approaches retraining at low error rates. This work enables robust, scalable data-entry error unlearning in industrial settings, reducing computational costs while preserving performance on correctly labeled data ($D_r$).
Abstract
Data entry constitutes a fundamental component of the machine learning pipeline, yet it frequently results in the introduction of labelling errors. When a model has been trained on a dataset containing such errors its performance is reduced. This leads to the challenge of efficiently unlearning the influence of the erroneous data to improve the model performance without needing to completely retrain the model. While model editing methods exist for cases in which the correct label for a wrong entry is known, we focus on the case of data entry errors where we do not know the correct labels for the erroneous data. Our contribution is twofold. First, we introduce an extension to the selective synaptic dampening unlearning method that removes the need for parameter tuning, making unlearning accessible to practitioners. We demonstrate the performance of this extension, adaptive selective synaptic dampening (ASSD), on various ResNet18 and Vision Transformer unlearning tasks. Second, we demonstrate the performance of ASSD in a supply chain delay prediction problem with labelling errors using real-world data where we randomly introduce various levels of labelling errors. The application of this approach is particularly compelling in industrial settings, such as supply chain management, where a significant portion of data entry occurs manually through Excel sheets, rendering it error-prone. ASSD shows strong performance on general unlearning benchmarks and on the error correction problem where it outperforms fine-tuning for error correction.
