A New Federated Learning Framework Against Gradient Inversion Attacks
Pengxin Guo, Shuang Zeng, Wenhao Chen, Xiaodan Zhang, Weihong Ren, Yuyin Zhou, Liangqiong Qu
TL;DR
HyperFL tackles privacy leakage in federated learning under Gradient Inversion Attacks by replacing direct gradient sharing with hypernetwork-generated per-client feature extractors. The framework decomposes models into a shared hypernetwork-generated feature extractor and a private classifier head, uploading only hypernetwork parameters for aggregation, with adapters for large pretrained models (HyperFL-LPM) to enable scalability. Theoretical analysis shows an $O(1/\sqrt{T})$ convergence rate (with potential improvements under PL conditions), while experiments on EMNIST, Fashion-MNIST, CIFAR-10, and CINIC-10 demonstrate competitive accuracy and strong privacy against IG/ROG-style attacks, outperforming DP-based FL baselines. HyperFL offers a practical, scalable privacy-preserving FL paradigm that reduces leakage without heavy cryptographic overhead, making it attractive for heterogeneous and large-scale deployments.
Abstract
Federated Learning (FL) aims to protect data privacy by enabling clients to collectively train machine learning models without sharing their raw data. However, recent studies demonstrate that information exchanged during FL is subject to Gradient Inversion Attacks (GIA) and, consequently, a variety of privacy-preserving methods have been integrated into FL to thwart such attacks, such as Secure Multi-party Computing (SMC), Homomorphic Encryption (HE), and Differential Privacy (DP). Despite their ability to protect data privacy, these approaches inherently involve substantial privacy-utility trade-offs. By revisiting the key to privacy exposure in FL under GIA, which lies in the frequent sharing of model gradients that contain private data, we take a new perspective by designing a novel privacy preserve FL framework that effectively ``breaks the direct connection'' between the shared parameters and the local private data to defend against GIA. Specifically, we propose a Hypernetwork Federated Learning (HyperFL) framework that utilizes hypernetworks to generate the parameters of the local model and only the hypernetwork parameters are uploaded to the server for aggregation. Theoretical analyses demonstrate the convergence rate of the proposed HyperFL, while extensive experimental results show the privacy-preserving capability and comparable performance of HyperFL. Code is available at https://github.com/Pengxin-Guo/HyperFL.
