Multi-level Personalized Federated Learning on Heterogeneous and Long-Tailed Data
Rongyu Zhang, Yun Chen, Chenrui Wu, Fangxin Wang, Bo Li
TL;DR
This work tackles non-i.i.d. and long-tailed data in federated learning for autonomous driving by introducing MuPFL, a three-level personalized framework. Local Biased Activation Value Dropout (BAVD) reduces overfitting and speeds training, intermediate Adaptive Cluster-based Model Update (ACMU) dynamically clusters and pre-updates similar models, and the central server's Prior Knowledge-assisted Classifier Fine-tuning (PKCF) injects global knowledge into local classifiers to improve tail-class performance. The approach yields consistent accuracy improvements over strong baselines and substantially faster convergence, as demonstrated on image classification and Cityscapes semantic segmentation tasks, with ablations confirming the contributions of each module. Overall, MuPFL offers a scalable, efficient path to robust personalized learning under extreme data heterogeneity, enabling better real-world deployment in autonomous systems.
Abstract
Federated learning (FL) offers a privacy-centric distributed learning framework, enabling model training on individual clients and central aggregation without necessitating data exchange. Nonetheless, FL implementations often suffer from non-i.i.d. and long-tailed class distributions across mobile applications, e.g., autonomous vehicles, which leads models to overfitting as local training may converge to sub-optimal. In our study, we explore the impact of data heterogeneity on model bias and introduce an innovative personalized FL framework, Multi-level Personalized Federated Learning (MuPFL), which leverages the hierarchical architecture of FL to fully harness computational resources at various levels. This framework integrates three pivotal modules: Biased Activation Value Dropout (BAVD) to mitigate overfitting and accelerate training; Adaptive Cluster-based Model Update (ACMU) to refine local models ensuring coherent global aggregation; and Prior Knowledge-assisted Classifier Fine-tuning (PKCF) to bolster classification and personalize models in accord with skewed local data with shared knowledge. Extensive experiments on diverse real-world datasets for image classification and semantic segmentation validate that MuPFL consistently outperforms state-of-the-art baselines, even under extreme non-i.i.d. and long-tail conditions, which enhances accuracy by as much as 7.39% and accelerates training by up to 80% at most, marking significant advancements in both efficiency and effectiveness.
