Straggler-Resilient Differentially-Private Decentralized Learning
Yauhen Yakimenka, Chung-Wei Weng, Hsuan-Yin Lin, Eirik Rosnes, Jörg Kliewer
TL;DR
This work tackles the straggler problem in fully decentralized learning on a ring while preserving user privacy by extending network differential privacy (NDP) to account for total training latency. It introduces a skipping scheme and analyzes both a ring and a randomized ring, deriving convergence guarantees that align with the baseline results of prior SGD analyses and DP bounds via Rényi DP with privacy amplification by iteration. A key finding is that the DP leakage $\varepsilon_{\mathrm{skip}}$ scales linearly with the total number of updates (and thus with $h_{\max}$ under certain conditions), while randomizing the ring improves privacy amplification without sacrificing asymptotic convergence. Empirical validation on OpenML housing data for logistic regression and on MNIST/CIFAR-10 demonstrates practical latency reductions and a quantifiable privacy-utility trade-off, illustrating how to tune the skip timeout to balance speed, accuracy, and privacy in decentralized learning systems.
Abstract
We consider the straggler problem in decentralized learning over a logical ring while preserving user data privacy. Especially, we extend the recently proposed framework of differential privacy (DP) amplification by decentralization by Cyffers and Bellet to include overall training latency--comprising both computation and communication latency. Analytical results on both the convergence speed and the DP level are derived for both a skipping scheme (which ignores the stragglers after a timeout) and a baseline scheme that waits for each node to finish before the training continues. A trade-off between overall training latency, accuracy, and privacy, parameterized by the timeout of the skipping scheme, is identified and empirically validated for logistic regression on a real-world dataset and for image classification using the MNIST and CIFAR-10 datasets.
