Federated Unlearning Made Practical: Seamless Integration via Negated Pseudo-Gradients
Alessio Mora, Carlo Mazzocca, Rebecca Montanari, Paolo Bellavista
TL;DR
This work tackles federated unlearning by introducing PUF, a method that treats client updates as pseudo-gradients and negates them to erase a target client’s influence without storing historical data or altering the standard FedAvg workflow. PUF operates in two modes—PUF-Regular and PUF-Special—offering seamless integration with regular FL rounds and concurrent unlearning requests, while maintaining task-agnosticity. Extensive experiments on CIFAR-10, CIFAR-100, and ProstateMRI demonstrate that PUF achieves superior forgetting effectiveness with substantially reduced communication, computation, and storage costs compared to state-of-the-art baselines. The approach preserves model utility after recovery and requires only minimal hyperparameter tuning, making it practical for real-world FL deployments. Overall, PUF provides a principled, scalable solution to the right to be forgotten in privacy-preserving machine learning contexts.
Abstract
The right to be forgotten is a fundamental principle of privacy-preserving regulations and extends to Machine Learning (ML) paradigms such as Federated Learning (FL). While FL enhances privacy by enabling collaborative model training without sharing private data, trained models still retain the influence of training data. Federated Unlearning (FU) methods recently proposed often rely on impractical assumptions for real-world FL deployments, such as storing client update histories or requiring access to a publicly available dataset. To address these constraints, this paper introduces a novel method that leverages negated Pseudo-gradients Updates for Federated Unlearning (PUF). Our approach only uses standard client model updates, which are employed during regular FL rounds, and interprets them as pseudo-gradients. When a client needs to be forgotten, we apply the negation of their pseudo-gradients, appropriately scaled, to the global model. Unlike state-of-the-art mechanisms, PUF seamlessly integrates with FL workflows, incurs no additional computational and communication overhead beyond standard FL rounds, and supports concurrent unlearning requests. We extensively evaluated the proposed method on two well-known benchmark image classification datasets (CIFAR-10 and CIFAR-100) and a real-world medical imaging dataset for segmentation (ProstateMRI), using three different neural architectures: two residual networks and a vision transformer. The experimental results across various settings demonstrate that PUF achieves state-of-the-art forgetting effectiveness and recovery time, without relying on any additional assumptions.
