Differential Privacy as a Perk: Federated Learning over Multiple-Access Fading Channels with a Multi-Antenna Base Station
Hao Liang, Haifeng Wen, Kaishun Wu, Hong Xing
TL;DR
This work studies AirFL with a multi‑antenna base station under user‑level differential privacy, showing that channel noise can provide DP without artificial noise in many scenarios. It derives a convergent DP bound for general smooth, non‑convex losses under bounded parameter domains and formulates an optimization over receive beamforming and power to balance convergence and privacy. The authors prove that zero artificial noise is optimal in many cases and identify explicit conditions where DP is gained as a perk, then validate these results with Fashion‑MNIST experiments demonstrating privacy‑convergence trade‑offs and DP‑for‑free in low‑SNR settings. The findings offer a practical, scalable pathway to privacy‑preserving wireless FL that leverages physical layer randomness to achieve strong guarantees without sacrificing training performance, with broad implications for edge learning in wireless networks.
Abstract
Federated Learning (FL) is a distributed learning paradigm that preserves privacy by eliminating the need to exchange raw data during training. In its prototypical edge instantiation with underlying wireless transmissions enabled by analog over-the-air computing (AirComp), referred to as \emph{over-the-air FL (AirFL)}, the inherent channel noise plays a unique role of \emph{frenemy} in the sense that it degrades training due to noisy global aggregation while providing a natural source of randomness for privacy-preserving mechanisms, formally quantified by \emph{differential privacy (DP)}. It remains, nevertheless, challenging to effectively harness such channel impairments, as prior arts, under assumptions of either simple channel models or restricted types of loss functions, mostly considering (local) DP enhancement with a single-round or non-convergent bound on privacy loss. In this paper, we study AirFL over multiple-access fading channels with a multi-antenna base station (BS) subject to user-level DP requirements. Despite a recent study, which claimed in similar settings that artificial noise (AN) must be injected to ensure DP in general, we demonstrate, on the contrary, that DP can be gained as a \emph{perk} even \emph{without} employing any AN. Specifically, we derive a novel bound on DP that converges under general bounded-domain assumptions on model parameters, along with a convergence bound with general smooth and non-convex loss functions. Next, we optimize over receive beamforming and power allocations to characterize the optimal convergence-privacy trade-offs, which also reveal explicit conditions in which DP is achievable without compromising training. Finally, our theoretical findings are validated by extensive numerical results.
