Split Unlearning
Guangsheng Yu, Yanna Jiang, Qin Wang, Xu Wang, Baihe Ma, Caijun Sun, Wei Ni, Ren Ping Liu
TL;DR
This work presents SplitWiper, a SISA-aligned framework for Split Learning that decouples client/server propagation with a one-way-one-off scheme, enabling independent unlearning even when clients are unavailable. It further introduces SplitWiper+, which uses label expansion and DP-based masking to protect client labels during learning and unlearning. Across diverse datasets and modalities, SplitWiper achieves complete forgetting of unlearned labels with no degradation to retained labels and dramatic reductions in overhead, while SplitWiper+ preserves high label privacy against server-side inferences. The approach addresses regulatory 'right to forget' constraints in distributed SL and offers practical, privacy-preserving unlearning with strong empirical guarantees.
Abstract
We introduce Split Unlearning, a novel machine unlearning technology designed for Split Learning (SL), enabling the first-ever implementation of Sharded, Isolated, Sliced, and Aggregated (SISA) unlearning in SL frameworks. Particularly, the tight coupling between clients and the server in existing SL frameworks results in frequent bidirectional data flows and iterative training across all clients, violating the "Isolated" principle and making them struggle to implement SISA for independent and efficient unlearning. To address this, we propose SplitWiper with a new one-way-one-off propagation scheme, which leverages the inherently "Sharded" structure of SL and decouples neural signal propagation between clients and the server, enabling effective SISA unlearning even in scenarios with absent clients. We further design SplitWiper+ to enhance client label privacy, which integrates differential privacy and label expansion strategy to defend the privacy of client labels against the server and other potential adversaries. Experiments across diverse data distributions and tasks demonstrate that SplitWiper achieves 0% accuracy for unlearned labels, and 8% better accuracy for retained labels than non-SISA unlearning in SL. Moreover, the one-way-one-off propagation maintains constant overhead, reducing computational and communication costs by 99%. SplitWiper+ preserves 90% of label privacy when sharing masked labels with the server.
