Differentially Private Decentralized Learning with Random Walks
Edwige Cyffers, Aurélien Bellet, Jalaj Upadhyay
TL;DR
This work addresses privacy leakage in fully decentralized learning where model updates propagate via random walks on a graph. It introduces Pairwise Network Differential Privacy (PNDP) to capture pairwise privacy between node pairs and proposes a private random-walk SGD (RW-DP-SGD) that adds Gaussian noise at each local update. A central result is a closed-form PNDP bound for arbitrary graphs, showing that the privacy loss between nodes $u$ and $v$ after $T$ iterations depends on the graph topology through a matrix-log term, linking leakage to graph communicability. Empirical results on synthetic and real networks demonstrate that random-walk based updates can achieve better privacy-utility trade-offs than gossip algorithms, especially for closely connected nodes, with the analysis providing interpretable topology-driven guarantees that bridge local and central privacy models.
Abstract
The popularity of federated learning comes from the possibility of better scalability and the ability for participants to keep control of their data, improving data security and sovereignty. Unfortunately, sharing model updates also creates a new privacy attack surface. In this work, we characterize the privacy guarantees of decentralized learning with random walk algorithms, where a model is updated by traveling from one node to another along the edges of a communication graph. Using a recent variant of differential privacy tailored to the study of decentralized algorithms, namely Pairwise Network Differential Privacy, we derive closed-form expressions for the privacy loss between each pair of nodes where the impact of the communication topology is captured by graph theoretic quantities. Our results further reveal that random walk algorithms tends to yield better privacy guarantees than gossip algorithms for nodes close from each other. We supplement our theoretical results with empirical evaluation on synthetic and real-world graphs and datasets.
