Approximate Gradient Coding for Privacy-Flexible Federated Learning with Non-IID Data
Okko Makkonen, Sampo Niemelä, Camilla Hollanti, Serge Kas Hanna
TL;DR
The paper tackles federated learning under non-IID data and client stragglers by introducing a privacy-flexible paradigm in which a configurable portion of each client’s data is designated non-private. It combines a one-time offline randomized data sharing phase to reduce label heterogeneity with an approximate gradient coding scheme to tolerate stragglers, yielding an unbiased gradient estimator and a provable variance reduction. Theoretical results quantify how the expected heterogeneity diminishes by a factor related to the replication parameter $d$ and privacy level $c$, and how gradient variance decreases with straggler probability $p$, data replication, and privacy. Empirical validation on MNIST demonstrates faster convergence and robustness to non-IID distributions when appropriately choosing $(c,d)$, illustrating a practical privacy-utility trade-off with a manageable offline cost.
Abstract
This work focuses on the challenges of non-IID data and stragglers/dropouts in federated learning. We introduce and explore a privacy-flexible paradigm that models parts of the clients' local data as non-private, offering a more versatile and business-oriented perspective on privacy. Within this framework, we propose a data-driven strategy for mitigating the effects of label heterogeneity and client straggling on federated learning. Our solution combines both offline data sharing and approximate gradient coding techniques. Through numerical simulations using the MNIST dataset, we demonstrate that our approach enables achieving a deliberate trade-off between privacy and utility, leading to improved model convergence and accuracy while using an adaptable portion of non-private data.
