A Closer Look at Personalized Fine-Tuning in Heterogeneous Federated Learning
Minghui Chen, Hrad Ghoukasian, Ruinan Jin, Zehua Wang, Sai Praneeth Karimireddy, Xiaoxiao Li
TL;DR
The paper tackles the challenge of balancing client personalization and global generalization in heterogeneous federated learning by adapting the LP-FT (Linear Probing followed by Fine-Tuning) strategy to post-hoc personalization (PFT). Through large-scale empirical studies across seven datasets and multiple PFT baselines, it demonstrates that LP-FT reduces federated feature distortion and achieves superior global and average performance compared to standard fine-tuning methods. The authors provide a theoretical framework using a two-layer linear model to show that LP-FT yields lower global loss under concept shift and under combined covariate-concept shifts, with a threshold on heterogeneity below which LP-FT dominates. Supplementary experiments corroborate the theory, including label-shift and distortion analyses, highlighting LP-FT as a robust, deployment-friendly solution for robust personalization in FL.
Abstract
Federated Learning (FL) enables decentralized, privacy-preserving model training but struggles to balance global generalization and local personalization due to non-identical data distributions across clients. Personalized Fine-Tuning (PFT), a popular post-hoc solution, fine-tunes the final global model locally but often overfits to skewed client distributions or fails under domain shifts. We propose adapting Linear Probing followed by full Fine-Tuning (LP-FT), a principled centralized strategy for alleviating feature distortion (Kumar et al., 2022), to the FL setting. Through systematic evaluation across seven datasets and six PFT variants, we demonstrate LP-FT's superiority in balancing personalization and generalization. Our analysis uncovers federated feature distortion, a phenomenon where local fine-tuning destabilizes globally learned features, and theoretically characterizes how LP-FT mitigates this via phased parameter updates. We further establish conditions (e.g., partial feature overlap, covariate-concept shift) under which LP-FT outperforms standard fine-tuning, offering actionable guidelines for deploying robust personalization in FL.
