Building Gradient Bridges: Label Leakage from Restricted Gradient Sharing in Federated Learning
Rui Zhang, Ka-Ho Chow, Ping Li
TL;DR
Federated Learning exposes a privacy risk through gradient sharing, specifically leaking label distributions of private data. The authors present Gradient Bridge (GDBR), an attack that constructs a bridge from accessible bottom-layer gradients to the output logits, enabling recovery of batch label counts from limited gradient information, aided by batch-averaged gradients and auxiliary data. The method achieves robust label recovery across multiple datasets and architectures (e.g., LeNet on MNIST, AlexNet on SVHN, VGG11 on CIFAR-10, ResNet18 on CIFAR-100) and remains effective under gradient pruning and noise perturbation, highlighting weaknesses in lightweight defenses. The work underscores the need for stronger privacy protections in FL and provides a framework for future defense design and evaluation.
Abstract
The growing concern over data privacy, the benefits of utilizing data from diverse sources for model training, and the proliferation of networked devices with enhanced computational capabilities have all contributed to the rise of federated learning (FL). The clients in FL collaborate to train a global model by uploading gradients computed on their private datasets without collecting raw data. However, a new attack surface has emerged from gradient sharing, where adversaries can restore the label distribution of a victim's private data by analyzing the obtained gradients. To mitigate this privacy leakage, existing lightweight defenses restrict the sharing of gradients, such as encrypting the final-layer gradients or locally updating the parameters within. In this paper, we introduce a novel attack called Gradient Bridge (GDBR) that recovers the label distribution of training data from the limited gradient information shared in FL. GDBR explores the relationship between the layer-wise gradients, tracks the flow of gradients, and analytically derives the batch training labels. Extensive experiments show that GDBR can accurately recover more than 80% of labels in various FL settings. GDBR highlights the inadequacy of restricted gradient sharing-based defenses and calls for the design of effective defense schemes in FL.
