Toward Efficient Data-Free Unlearning
Chenhao Zhang, Shaofei Shen, Weitong Chen, Miao Xu
TL;DR
Toward Efficient Data-Free Unlearning addresses the challenge of forgetting without access to real data by analyzing inefficiencies in data-free distillation and introducing ISPF, a two-component framework. Inhibited Synthesis reduces synthesis of forgetting-class information, while PostFilter fully leverages retaining-class information from all synthetic samples during distillation. Across SVHN, CIFAR-10, and CIFAR-100, ISPF consistently improves retaining accuracy, reduces forgetting, strengthens unlearning guarantees, and achieves faster training efficiency than prior methods. The results demonstrate that enriching retaining-class information and exploiting all synthetic data are effective strategies for data-free unlearning with practical impact for privacy-preserving model maintenance.
Abstract
Machine unlearning without access to real data distribution is challenging. The existing method based on data-free distillation achieved unlearning by filtering out synthetic samples containing forgetting information but struggled to distill the retaining-related knowledge efficiently. In this work, we analyze that such a problem is due to over-filtering, which reduces the synthesized retaining-related information. We propose a novel method, Inhibited Synthetic PostFilter (ISPF), to tackle this challenge from two perspectives: First, the Inhibited Synthetic, by reducing the synthesized forgetting information; Second, the PostFilter, by fully utilizing the retaining-related information in synthesized samples. Experimental results demonstrate that the proposed ISPF effectively tackles the challenge and outperforms existing methods.
