Fully Decentralized Certified Unlearning
Hithem Lamri, Michail Maniatakos
TL;DR
The paper tackles certified unlearning in fully decentralized networks with fixed topologies by introducing RR-DU, a random-walk based unlearning algorithm. RR-DU localizes noise to the unlearning client and leverages randomized routing, local averaging, and a trust-region constraint to achieve strong privacy-utility trade-offs, with network-DP view-based certificates and convergence guarantees. The authors establish two regimes for deletion capacity, show that RR-DU avoids the forget-set size dependence that plagues decentralized DP baselines, and provide both theoretical results (last-iterate guarantees, stationarity, and deletion-capacity bounds) and empirical validation on MNIST and CIFAR-10 demonstrating near-scratch forgetting with superior retained accuracy. These contributions enable practical, scalable certified unlearning in decentralized settings, with explicit guidance on routing, averaging, and noise calibration for real-world deployment.
Abstract
Machine unlearning (MU) seeks to remove the influence of specified data from a trained model in response to privacy requests or data poisoning. While certified unlearning has been analyzed in centralized and server-orchestrated federated settings (via guarantees analogous to differential privacy, DP), the decentralized setting -- where peers communicate without a coordinator remains underexplored. We study certified unlearning in decentralized networks with fixed topologies and propose RR-DU, a random-walk procedure that performs one projected gradient ascent step on the forget set at the unlearning client and a geometrically distributed number of projected descent steps on the retained data elsewhere, combined with subsampled Gaussian noise and projection onto a trust region around the original model. We provide (i) convergence guarantees in the convex case and stationarity guarantees in the nonconvex case, (ii) $(\varepsilon,δ)$ network-unlearning certificates on client views via subsampled Gaussian Rényi DP (RDP) with segment-level subsampling, and (iii) deletion-capacity bounds that scale with the forget-to-local data ratio and quantify the effect of decentralization (network mixing and randomized subsampling) on the privacy-utility trade-off. Empirically, on image benchmarks (MNIST, CIFAR-10), RR-DU matches a given $(\varepsilon,δ)$ while achieving higher test accuracy than decentralized DP baselines and reducing forget accuracy to random guessing ($\approx 10\%$).
