Private Heterogeneous Federated Learning Without a Trusted Server Revisited: Error-Optimal and Communication-Efficient Algorithms for Convex Losses
Changyu Gao, Andrew Lowy, Xingyu Zhou, Stephen J. Wright
TL;DR
The paper tackles private federated learning in the absence of a trusted server under ISRL-DP, addressing heterogeneity across silos and the desire for fewer communication rounds. It introduces a localized ISRL-DP accelerated MB-SGD framework for smooth losses, achieving optimal excess risk in the heterogeneous setting with sharp, near-private lower-bound-matching communication complexity and improved gradient complexity. For nonsmooth losses, the authors develop smoothing-based and direct subgradient variants that preserve optimal ISRL-DP rates and offer favorable trade-offs between communication and computation. Theoretical results are complemented by MNIST-based experiments showing substantial practical gains over prior ISRL-DP methods, including robustness to unreliable communication. Overall, the work advances private FL by attaining error-optimal performance without assuming data homogeneity and by delivering improved efficiency in both communication and computation, with open questions about lower bounds and universal optimality across regimes.
Abstract
We revisit the problem of federated learning (FL) with private data from people who do not trust the server or other silos/clients. In this context, every silo (e.g. hospital) has data from several people (e.g. patients) and needs to protect the privacy of each person's data (e.g. health records), even if the server and/or other silos try to uncover this data. Inter-Silo Record-Level Differential Privacy (ISRL-DP) prevents each silo's data from being leaked, by requiring that silo i's communications satisfy item-level differential privacy. Prior work arXiv:2106.09779 characterized the optimal excess risk bounds for ISRL-DP algorithms with homogeneous (i.i.d.) silo data and convex loss functions. However, two important questions were left open: (1) Can the same excess risk bounds be achieved with heterogeneous (non-i.i.d.) silo data? (2) Can the optimal risk bounds be achieved with fewer communication rounds? In this paper, we give positive answers to both questions. We provide novel ISRL-DP FL algorithms that achieve the optimal excess risk bounds in the presence of heterogeneous silo data. Moreover, our algorithms are more communication-efficient than the prior state-of-the-art. For smooth loss functions, our algorithm achieves the optimal excess risk bound and has communication complexity that matches the non-private lower bound. Additionally, our algorithms are more computationally efficient than the previous state-of-the-art.
