KoReA-SFL: Knowledge Replay-based Split Federated Learning Against Catastrophic Forgetting
Zeke Xia, Ming Hu, Dengke Yan, Ruixuan Liu, Anran Li, Xiaofei Xie, Mingsong Chen
TL;DR
KoReA-SFL addresses non-IID data-induced gradient divergence and catastrophic forgetting in Split Federated Learning by maintaining multiple branch server-side and client-side portions with a shared master model for knowledge sharing. It introduces a knowledge replay mechanism that solicitously samples underrepresented category features from assistant devices based on per-branch data-distribution scores, combined with an adaptive sampling proportion governed by the Federated Gradient Norm. The approach is supported by a convergence analysis under standard assumptions and empirical results showing substantial accuracy gains (up to about 23% or more in challenging non-IID settings) across diverse datasets and models, along with ablation studies validating the contributions of knowledge replay and adaptive sampling. Overall, KoReA-SFL demonstrates improved robustness, stability, and generalization in resource-constrained, heterogeneous federated environments, enabling more effective SFL deployments.
Abstract
Although Split Federated Learning (SFL) is good at enabling knowledge sharing among resource-constrained clients, it suffers from the problem of low training accuracy due to the neglect of data heterogeneity and catastrophic forgetting. To address this issue, we propose a novel SFL approach named KoReA-SFL, which adopts a multi-model aggregation mechanism to alleviate gradient divergence caused by heterogeneous data and a knowledge replay strategy to deal with catastrophic forgetting. Specifically, in KoReA-SFL cloud servers (i.e., fed server and main server) maintain multiple branch model portions rather than a global portion for local training and an aggregated master-model portion for knowledge sharing among branch portions. To avoid catastrophic forgetting, the main server of KoReA-SFL selects multiple assistant devices for knowledge replay according to the training data distribution of each server-side branch-model portion. Experimental results obtained from non-IID and IID scenarios demonstrate that KoReA-SFL significantly outperforms conventional SFL methods (by up to 23.25\% test accuracy improvement).
