SCALA: Split Federated Learning with Concatenated Activations and Logit Adjustments
Jiarong Yang, Yuan Liu
TL;DR
The paper tackles label distribution skew in Split Federated Learning caused by data heterogeneity and partial participation. It proposes SCALA, which centralizes server-side training on concatenated client activations and applies logit-adjusted losses to balance learning across skewed labels. Theoretical convergence guarantees and extensive experiments demonstrate robust improvements over baselines across multiple datasets and participation scenarios, including a privacy-enhanced variant. These findings highlight SCALA’s practical potential for scalable, skew-resilient distributed learning with explicit mechanisms to address both local and global label distribution challenges.
Abstract
Split Federated Learning (SFL) is a distributed machine learning framework which strategically divides the learning process between a server and clients and collaboratively trains a shared model by aggregating local models updated based on data from distributed clients. However, data heterogeneity and partial client participation result in label distribution skew, which severely degrades the learning performance. To address this issue, we propose SFL with Concatenated Activations and Logit Adjustments (SCALA). Specifically, the activations from the client-side models are concatenated as the input of the server-side model so as to centrally adjust label distribution across different clients, and logit adjustments of loss functions on both server-side and client-side models are performed to deal with the label distribution variation across different subsets of participating clients. Theoretical analysis and experimental results verify the superiority of the proposed SCALA on public datasets.
