Defending Against Data Reconstruction Attacks in Federated Learning: An Information Theory Approach
Qi Tan, Qi Li, Yi Zhao, Zhuotao Liu, Xiaobing Guo, Ke Xu
TL;DR
The paper tackles privacy leakage in Federated Learning caused by Data Reconstruction Attacks (DRA) by formulating an information-theoretic channel model that ties DRA success to the mutual information $I(\mathbf{D}; \mathbf{W})$ between local data $\mathbf{D}$ and transmitted parameters $\mathbf{W}$. It proves a lower bound linking reconstruction error to MI, $\mathbb{E}[\|\mathbf{D}-\hat{\mathbf{D}}(\mathbf{W})\|^2/d] \ge \frac{e^{2h(\mathbf{D})/d}}{2\pi e} e^{-2I(\mathbf{D}; \mathbf{W})/d}$, and introduces a channel-capacity based mechanism to bound leakage across training rounds. The authors then shift defense from parameter-space manipulation to data-space protection by mapping data to a noisy version $\widetilde{\mathbf{D}} = \mathbf{D}+\boldsymbol{\xi}$ and solving $f(\sigma)=\kappa$ to bound per-round information leakage $I(\mathbf{D}; \widetilde{\mathbf{W}}_o|\mathbf{W}_i)$. They present three data-space channel implementations—Natural, White, and Personalized—showing improved utility-privacy tradeoffs and compatibility with DP, compression, and large-batch strategies, with extensive experiments across real datasets validating the approach. Overall, the work provides a principled, information-theoretic framework for defending against DRA in FL through controlled information transfer and data-space transformations, yielding practical methods that balance privacy and utility.
Abstract
Federated Learning (FL) trains a black-box and high-dimensional model among different clients by exchanging parameters instead of direct data sharing, which mitigates the privacy leak incurred by machine learning. However, FL still suffers from membership inference attacks (MIA) or data reconstruction attacks (DRA). In particular, an attacker can extract the information from local datasets by constructing DRA, which cannot be effectively throttled by existing techniques, e.g., Differential Privacy (DP). In this paper, we aim to ensure a strong privacy guarantee for FL under DRA. We prove that reconstruction errors under DRA are constrained by the information acquired by an attacker, which means that constraining the transmitted information can effectively throttle DRA. To quantify the information leakage incurred by FL, we establish a channel model, which depends on the upper bound of joint mutual information between the local dataset and multiple transmitted parameters. Moreover, the channel model indicates that the transmitted information can be constrained through data space operation, which can improve training efficiency and the model accuracy under constrained information. According to the channel model, we propose algorithms to constrain the information transmitted in a single round of local training. With a limited number of training rounds, the algorithms ensure that the total amount of transmitted information is limited. Furthermore, our channel model can be applied to various privacy-enhancing techniques (such as DP) to enhance privacy guarantees against DRA. Extensive experiments with real-world datasets validate the effectiveness of our methods.
