Privacy-Preserving, Dropout-Resilient Aggregation in Decentralized Learning
Ali Reza Ghavamipour, Benjamin Zi Hao Zhao, Fatih Turkmen
TL;DR
This work addresses the privacy and dropout challenges of decentralized learning by introducing three secret-sharing-based aggregation protocols that are dropout-resilient. It combines Shamir's Secret Sharing, LWE-based masking, and pairwise masking with DH key exchange to enable secure, scalable aggregation without a central server. Across MNIST, Fashion-MNIST, SVHN, and CIFAR-10, the methods achieve strong privacy guarantees while maintaining high accuracy, with PPDL-NV and PPDL-PW closely approaching the centralized baseline and PPDL-LWE offering robust privacy with competitive efficiency. The study demonstrates that privacy-preserving, dropout-tolerant decentralized learning can operate efficiently at scale, outperforming traditional secure aggregation approaches in many scenarios and providing practical guidance for deploying DL in privacy-sensitive, unreliable networks.
Abstract
Decentralized learning (DL) offers a novel paradigm in machine learning by distributing training across clients without central aggregation, enhancing scalability and efficiency. However, DL's peer-to-peer model raises challenges in protecting against inference attacks and privacy leaks. By forgoing central bottlenecks, DL demands privacy-preserving aggregation methods to protect data from 'honest but curious' clients and adversaries, maintaining network-wide privacy. Privacy-preserving DL faces the additional hurdle of client dropout, clients not submitting updates due to connectivity problems or unavailability, further complicating aggregation. This work proposes three secret sharing-based dropout resilience approaches for privacy-preserving DL. Our study evaluates the efficiency, performance, and accuracy of these protocols through experiments on datasets such as MNIST, Fashion-MNIST, SVHN, and CIFAR-10. We compare our protocols with traditional secret-sharing solutions across scenarios, including those with up to 1000 clients. Evaluations show that our protocols significantly outperform conventional methods, especially in scenarios with up to 30% of clients dropout and model sizes of up to $10^6$ parameters. Our approaches demonstrate markedly high efficiency with larger models, higher dropout rates, and extensive client networks, highlighting their effectiveness in enhancing decentralized learning systems' privacy and dropout robustness.
