Differentially-Private Multi-Tier Federated Learning: A Formal Analysis and Evaluation
Evan Chen, Frank Po-Chen Lin, Dong-Jun Han, Christopher G. Brinton
TL;DR
<3-5 sentence high-level summary>: The paper tackles the challenge of preserving privacy in multi-tier federated learning by integrating differential privacy across edge, fog, and cloud layers with heterogeneous trust models. It proposes M^2FDP, a DP-enhanced MFL framework that adaptively injects noise across the network hierarchy, and provides a non-convex convergence analysis showing sublinear convergence to a controllable stationary region influenced by trust and topology. An adaptive control algorithm is developed to jointly optimize step size, local training intervals, and participant selection to balance energy, latency, and accuracy while meeting DP guarantees. Empirical results demonstrate significant improvements in convergence speed, energy efficiency, and latency over baselines, especially when secure intermediate nodes are prevalent. These results highlight the practical viability of privacy-preserving, multi-tier ML in heterogeneous edge/fog/cloud deployments.
Abstract
While federated learning (FL) eliminates the transmission of raw data over a network, it is still vulnerable to privacy breaches from the communicated model parameters. Differential privacy (DP) is often employed to address such issues. However, the impact of DP on FL in multi-tier networks -- where hierarchical aggregations couple noise injection decisions at different tiers, and trust models are heterogeneous across subnetworks -- is not well understood. To fill this gap, we develop \underline{M}ulti-Tier \underline{F}ederated Learning with \underline{M}ulti-Tier \underline{D}ifferential \underline{P}rivacy ({\tt M$^2$FDP}), a DP-enhanced FL methodology for jointly optimizing privacy and performance over such networks. One of the key principles of {\tt M$^2$FDP} is to adapt DP noise injection across the established edge/fog computing hierarchy (e.g., edge devices, intermediate nodes, and other tiers up to cloud servers) according to the trust models in different subnetworks. We conduct a comprehensive analysis of the convergence behavior of {\tt M$^2$FDP} under non-convex problem settings, revealing conditions on parameter tuning under which the training process converges sublinearly to a finite stationarity gap that depends on the network hierarchy, trust model, and target privacy level. We show how these relationships can be employed to develop an adaptive control algorithm for {\tt M$^2$FDP} that tunes properties of local model training to minimize energy, latency, and the stationarity gap while meeting desired convergence and privacy criterion. Subsequent numerical evaluations demonstrate that {\tt M$^2$FDP} obtains substantial improvements in these metrics over baselines for different privacy budgets and system configurations.
