Table of Contents
Fetching ...

Adversarial Mixup Unlearning

Zhuoyi Peng, Yixuan Tang, Yi Yang

TL;DR

This paper tackles catastrophic unlearning, where erasing targeted data from a trained model can inadvertently degrade performance on retained knowledge. It introduces MixUnlearn, a generator–unlearner framework that uses an adversarial mixup generator to produce hard samples from Forgetting and Remaining data, coupled with two contrastive losses to regularize forgetting and retention. The method achieves superior or competitive unlearning performance across class- and data-level tasks on datasets like CIFAR-10, SVHN, MNIST, and Fashion-MNIST, with robustness to noisy and semi-supervised settings and notable efficiency gains via a lightweight MixBlock and periodic generator updates. Extensive analyses—ablations, representation visualizations, KDE loss-distributions, and large-scale ImageNet/Vit experiments—support MixUnlearn as an effective, scalable approach to approximate machine unlearning that mitigates catastrophic forgetting while preserving essential knowledge and generalization.

Abstract

Machine unlearning is a critical area of research aimed at safeguarding data privacy by enabling the removal of sensitive information from machine learning models. One unique challenge in this field is catastrophic unlearning, where erasing specific data from a well-trained model unintentionally removes essential knowledge, causing the model to deviate significantly from a retrained one. To address this, we introduce a novel approach that regularizes the unlearning process by utilizing synthesized mixup samples, which simulate the data susceptible to catastrophic effects. At the core of our approach is a generator-unlearner framework, MixUnlearn, where a generator adversarially produces challenging mixup examples, and the unlearner effectively forgets target information based on these synthesized data. Specifically, we first introduce a novel contrastive objective to train the generator in an adversarial direction: generating examples that prompt the unlearner to reveal information that should be forgotten, while losing essential knowledge. Then the unlearner, guided by two other contrastive loss terms, processes the synthesized and real data jointly to ensure accurate unlearning without losing critical knowledge, overcoming catastrophic effects. Extensive evaluations across benchmark datasets demonstrate that our method significantly outperforms state-of-the-art approaches, offering a robust solution to machine unlearning. This work not only deepens understanding of unlearning mechanisms but also lays the foundation for effective machine unlearning with mixup augmentation.

Adversarial Mixup Unlearning

TL;DR

This paper tackles catastrophic unlearning, where erasing targeted data from a trained model can inadvertently degrade performance on retained knowledge. It introduces MixUnlearn, a generator–unlearner framework that uses an adversarial mixup generator to produce hard samples from Forgetting and Remaining data, coupled with two contrastive losses to regularize forgetting and retention. The method achieves superior or competitive unlearning performance across class- and data-level tasks on datasets like CIFAR-10, SVHN, MNIST, and Fashion-MNIST, with robustness to noisy and semi-supervised settings and notable efficiency gains via a lightweight MixBlock and periodic generator updates. Extensive analyses—ablations, representation visualizations, KDE loss-distributions, and large-scale ImageNet/Vit experiments—support MixUnlearn as an effective, scalable approach to approximate machine unlearning that mitigates catastrophic forgetting while preserving essential knowledge and generalization.

Abstract

Machine unlearning is a critical area of research aimed at safeguarding data privacy by enabling the removal of sensitive information from machine learning models. One unique challenge in this field is catastrophic unlearning, where erasing specific data from a well-trained model unintentionally removes essential knowledge, causing the model to deviate significantly from a retrained one. To address this, we introduce a novel approach that regularizes the unlearning process by utilizing synthesized mixup samples, which simulate the data susceptible to catastrophic effects. At the core of our approach is a generator-unlearner framework, MixUnlearn, where a generator adversarially produces challenging mixup examples, and the unlearner effectively forgets target information based on these synthesized data. Specifically, we first introduce a novel contrastive objective to train the generator in an adversarial direction: generating examples that prompt the unlearner to reveal information that should be forgotten, while losing essential knowledge. Then the unlearner, guided by two other contrastive loss terms, processes the synthesized and real data jointly to ensure accurate unlearning without losing critical knowledge, overcoming catastrophic effects. Extensive evaluations across benchmark datasets demonstrate that our method significantly outperforms state-of-the-art approaches, offering a robust solution to machine unlearning. This work not only deepens understanding of unlearning mechanisms but also lays the foundation for effective machine unlearning with mixup augmentation.

Paper Structure

This paper contains 30 sections, 8 equations, 11 figures, 7 tables.

Figures (11)

  • Figure 1: A toy example of unlearning a focal class (in orange) to motivate our solution. The shaded orange represents the effects of forgetting, while the shaded blue indicates the retention of remaining knowledge. This example demonstrates how forgetting can inadvertently harm the knowledge that should be retained, as shown by the overlapping shaded regions, ultimately degrading the generalizability on unseen data (shown as stars). Notably, we can sythesize mixup samples—which are strategically mixed from Forgetting and Remaining—to mimic the data vulnerable to the overlapping catastrophic effects. By removing the forgetting effects and retaining remaining knowledge on these mixup samples, we can overcome catastrophic unlearning.
  • Figure 2: The goal of our generator is to produce hard mixed samples that challenge the unlearner. These mixed samples can prompt the unlearner to reveal information related to Forgetting and simultaneously disrupt the retention of Remaining, reversing the direction of unlearning process (the goal of unlearning is to forget Forgetting while preserving Remaining). To train the generator, we hold the unlearner fixed and update the generator's parameters with a proposed contrastive objective of Eq. \ref{['eq:optimize_g']}. The unlearner then performs effective unlearning based on these mixed samples.
  • Figure 3: Representation distributions in the class-level unlearning on the CIFAR-10. The blue denotes forgetting class (class 0; airplane) while the other colors denote the remaining data. The involved classes are: airplane, automobile, bird, cat, deer, dog, frog, horse, ship and truck.
  • Figure 4: Kernel Density Estimate Plots for Loss Distributions. We use the setting of class-level unlearning on CIFAR10. The horizontal axis is CrossEntropyLoss value and the vertical is density.
  • Figure 5: Learning Curve of Initial Model.
  • ...and 6 more figures