Privacy-Preserving Federated Learning with Consistency via Knowledge Distillation Using Conditional Generator
Kangyang Luo, Shuai Wang, Xiang Li, Yunshi Lan, Ming Gao, Jinlong Shu
TL;DR
This work tackles privacy leakage in Federated Learning by replacing per-client feature extractors with conditional generators and enforcing consistency through knowledge distillation at both latent-feature and logit levels. It introduces a two-stage client-side distillation and a crossed server-side distillation scheme, augmented with diversity constraints to prevent mode collapse, all without training extra discriminators. Empirical results on EMNIST, FMNIST, and CIFAR-10 show FedMD-CG delivering competitive accuracy while providing stronger privacy protection than full-sharing baselines, and ablations confirm the effectiveness of the proposed distillation and aggregation strategies. The proposed approach offers a practical pathway to high-performance, privacy-conscious FL in heterogeneous data settings, with clear directions for efficiency improvements and scaling.
Abstract
Federated Learning (FL) is gaining popularity as a distributed learning framework that only shares model parameters or gradient updates and keeps private data locally. However, FL is at risk of privacy leakage caused by privacy inference attacks. And most existing privacy-preserving mechanisms in FL conflict with achieving high performance and efficiency. Therefore, we propose FedMD-CG, a novel FL method with highly competitive performance and high-level privacy preservation, which decouples each client's local model into a feature extractor and a classifier, and utilizes a conditional generator instead of the feature extractor to perform server-side model aggregation. To ensure the consistency of local generators and classifiers, FedMD-CG leverages knowledge distillation to train local models and generators at both the latent feature level and the logit level. Also, we construct additional classification losses and design new diversity losses to enhance client-side training. FedMD-CG is robust to data heterogeneity and does not require training extra discriminators (like cGAN). We conduct extensive experiments on various image classification tasks to validate the superiority of FedMD-CG.
