Federated Unlearning: How to Efficiently Erase a Client in FL?
Anisa Halimi, Swanand Kadhe, Ambrish Rawat, Nathalie Baracaldo
TL;DR
The paper addresses removing a single client’s entire data influence in federated learning by first performing constrained local unlearning at the target client via projected gradient descent around a reference model, then continuing with a few rounds of federated learning starting from this unlearned model. This two-phase approach achieves comparable results to retraining from scratch while significantly reducing communication and computation costs, without requiring global data access or stored update histories. Empirical evaluations on backdoors and flipped-image scenarios across MNIST, EMNIST, and CIFAR-10 demonstrate strong efficacy, fidelity, and substantial efficiency gains, highlighting practical applicability in cross-silo FL. The method’s key novelty lies in formulating local unlearning as a constrained optimization around a reference model and leveraging minimal FL rounds to recover a robust, unlearned global model.
Abstract
With privacy legislation empowering the users with the right to be forgotten, it has become essential to make a model amenable for forgetting some of its training data. However, existing unlearning methods in the machine learning context can not be directly applied in the context of distributed settings like federated learning due to the differences in learning protocol and the presence of multiple actors. In this paper, we tackle the problem of federated unlearning for the case of erasing a client by removing the influence of their entire local data from the trained global model. To erase a client, we propose to first perform local unlearning at the client to be erased, and then use the locally unlearned model as the initialization to run very few rounds of federated learning between the server and the remaining clients to obtain the unlearned global model. We empirically evaluate our unlearning method by employing multiple performance measures on three datasets, and demonstrate that our unlearning method achieves comparable performance as the gold standard unlearning method of federated retraining from scratch, while being significantly efficient. Unlike prior works, our unlearning method neither requires global access to the data used for training nor the history of the parameter updates to be stored by the server or any of the clients.
