HFedCKD: Toward Robust Heterogeneous Federated Learning via Data-free Knowledge Distillation and Two-way Contrast

Yiting Zheng; Bohan Lin; Jinqian Chen; Jihua Zhu

HFedCKD: Toward Robust Heterogeneous Federated Learning via Data-free Knowledge Distillation and Two-way Contrast

Yiting Zheng, Bohan Lin, Jinqian Chen, Jihua Zhu

TL;DR

The paper tackles the challenge of heterogeneous federated learning under limited communication by introducing HFedCKD, a framework that combines data-free knowledge distillation with inverse probability weighting and hierarchical bidirectional contrastive learning. It stabilizes global knowledge transfer when client participation is sparse and models are diverse by dynamically reweighting client contributions (IPWD) and aligning feature spaces while preserving personalization through Encode-Global Alignment and Decode-History Alignment. Key contributions include a data-free KD scheme that mitigates generator bias, a two-way contrastive learning mechanism across multiple layers, and comprehensive experiments on image and IoT datasets demonstrating robustness to low participation and non-IID data. The work has practical impact for deploying robust, privacy-preserving FL in real-world, resource-constrained environments with heterogeneous devices and variable participation.

Abstract

Most current federated learning frameworks are modeled as static processes, ignoring the dynamic characteristics of the learning system. Under the limited communication budget of the central server, the flexible model architecture of a large number of clients participating in knowledge transfer requires a lower participation rate, active clients have uneven contributions, and the client scale seriously hinders the performance of FL. We consider a more general and practical federation scenario and propose a system heterogeneous federation method based on data-free knowledge distillation and two-way contrast (HFedCKD). We apply the Inverse Probability Weighted Distillation (IPWD) strategy to the data-free knowledge transfer framework. The generator completes the data features of the nonparticipating clients. IPWD implements a dynamic evaluation of the prediction contribution of each client under different data distributions. Based on the antibiased weighting of its prediction loss, the weight distribution of each client is effectively adjusted to fairly integrate the knowledge of participating clients. At the same time, the local model is split into a feature extractor and a classifier. Through differential contrast learning, the feature extractor is aligned with the global model in the feature space, while the classifier maintains personalized decision-making capabilities. HFedCKD effectively alleviates the knowledge offset caused by a low participation rate under data-free knowledge distillation and improves the performance and stability of the model. We conduct extensive experiments on image and IoT datasets to comprehensively evaluate and verify the generalization and robustness of the proposed HFedCKD framework.

HFedCKD: Toward Robust Heterogeneous Federated Learning via Data-free Knowledge Distillation and Two-way Contrast

TL;DR

Abstract

HFedCKD: Toward Robust Heterogeneous Federated Learning via Data-free Knowledge Distillation and Two-way Contrast

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)