Table of Contents
Fetching ...

pFedLoRA: Model-Heterogeneous Personalized Federated Learning with LoRA Tuning

Liping Yi, Han Yu, Gang Wang, Xiaoguang Liu, Xiaoxiao Li

TL;DR

This paper tackles model heterogeneity in federated learning by proposing FedLoRA, which inserts a small, homogeneous, low-rank adapter into heterogeneous local models to enable cross-client knowledge transfer. An iterative learning scheme trains the heterogeneous models and adapters in alternating phases, followed by FedAvg-style aggregation of adapters to form a global adapter. The authors prove a non-convex convergence rate of $O(1/T)$ and demonstrate that FedLoRA achieves higher accuracy with far lower computation ($O(11.81 imes$) and communication ($O(7.41 imes$) savings) than baselines on CIFAR-10/100 under non-IID conditions. The approach enables privacy-preserving, communication-efficient, model-heterogeneous personalized FL with robust performance across varying data distributions and client participation.

Abstract

Federated learning (FL) is an emerging machine learning paradigm in which a central server coordinates multiple participants (clients) collaboratively to train on decentralized data. In practice, FL often faces statistical, system, and model heterogeneities, which inspires the field of Model-Heterogeneous Personalized Federated Learning (MHPFL). With the increased interest in adopting large language models (LLMs) in FL, the existing MHPFL methods cannot achieve acceptable computational and communication costs, while maintaining satisfactory model performance. To bridge this gap, we propose a novel and efficient model-heterogeneous personalized Federated learning framework based on LoRA tuning (pFedLoRA). Inspired by the popular LoRA method for fine-tuning pre-trained LLMs with a low-rank model (a.k.a., an adapter), we design a homogeneous small adapter to facilitate federated client's heterogeneous local model training with our proposed iterative training for global-local knowledge exchange. The homogeneous small local adapters are aggregated on the FL server to generate a global adapter. We theoretically prove the convergence of pFedLoRA. Extensive experiments on two benchmark datasets demonstrate that pFedLoRA outperforms six state-of-the-art baselines, beating the best method by 1.35% in test accuracy, 11.81 times computation overhead reduction and 7.41 times communication cost saving.

pFedLoRA: Model-Heterogeneous Personalized Federated Learning with LoRA Tuning

TL;DR

This paper tackles model heterogeneity in federated learning by proposing FedLoRA, which inserts a small, homogeneous, low-rank adapter into heterogeneous local models to enable cross-client knowledge transfer. An iterative learning scheme trains the heterogeneous models and adapters in alternating phases, followed by FedAvg-style aggregation of adapters to form a global adapter. The authors prove a non-convex convergence rate of and demonstrate that FedLoRA achieves higher accuracy with far lower computation () and communication () savings) than baselines on CIFAR-10/100 under non-IID conditions. The approach enables privacy-preserving, communication-efficient, model-heterogeneous personalized FL with robust performance across varying data distributions and client participation.

Abstract

Federated learning (FL) is an emerging machine learning paradigm in which a central server coordinates multiple participants (clients) collaboratively to train on decentralized data. In practice, FL often faces statistical, system, and model heterogeneities, which inspires the field of Model-Heterogeneous Personalized Federated Learning (MHPFL). With the increased interest in adopting large language models (LLMs) in FL, the existing MHPFL methods cannot achieve acceptable computational and communication costs, while maintaining satisfactory model performance. To bridge this gap, we propose a novel and efficient model-heterogeneous personalized Federated learning framework based on LoRA tuning (pFedLoRA). Inspired by the popular LoRA method for fine-tuning pre-trained LLMs with a low-rank model (a.k.a., an adapter), we design a homogeneous small adapter to facilitate federated client's heterogeneous local model training with our proposed iterative training for global-local knowledge exchange. The homogeneous small local adapters are aggregated on the FL server to generate a global adapter. We theoretically prove the convergence of pFedLoRA. Extensive experiments on two benchmark datasets demonstrate that pFedLoRA outperforms six state-of-the-art baselines, beating the best method by 1.35% in test accuracy, 11.81 times computation overhead reduction and 7.41 times communication cost saving.
Paper Structure (23 sections, 2 theorems, 27 equations, 10 figures, 3 tables, 1 algorithm)

This paper contains 23 sections, 2 theorems, 27 equations, 10 figures, 3 tables, 1 algorithm.

Key Result

Lemma 5.3

Based on Assumptions assump:Lipschitz and assump:Unbiased, during $\{0,1,...,E\}$ local iterations of the $(t+1)$-th FL training round, the loss of an arbitrary client's local heterogeneous model is bounded by:

Figures (10)

  • Figure 1: The working principle of LoRA.
  • Figure 2: Workflow of FedLoRA.
  • Figure 3: Iterative learning in FedLoRA.
  • Figure 4: Two types of low-rank adapters.
  • Figure 5: Accuracy distribution for individual clients.
  • ...and 5 more figures

Theorems & Definitions (4)

  • Lemma 5.3
  • Theorem 5.4
  • proof
  • proof