LoRA-FAIR: Federated LoRA Fine-Tuning with Aggregation and Initialization Refinement
Jieming Bian, Lei Wang, Letian Zhang, Jie Xu
TL;DR
This paper tackles the high cost of fine-tuning foundation models by combining LoRA with Federated Learning to preserve privacy and reduce parameters. It identifies two key issues in federated LoRA: server-side aggregation bias and client-side initialization lag, and proposes LoRA-FAIR, which refines server-side aggregation with a residual correction $\Delta \mathbf{B}$ to form $\bar{\mathbf{B}}'$, while applying Avg-Initial initialization to preserve cross-round information. The approach achieves superior performance and favorable communication costs on ViT and MLP-Mixer across DomainNet and NICO++ under feature non-IID settings, outperforming state-of-the-art baselines like FedIT, FFA-LoRA, FLoRA, and FlexLoRA. Theoretical guarantees are provided showing a contraction in the residual aggregation error and improved convergence bounds in non-IID settings, supporting the practical benefit of LoRA-FAIR for scalable federated fine-tuning of large models.
Abstract
Foundation models (FMs) achieve strong performance across diverse tasks with task-specific fine-tuning, yet full parameter fine-tuning is often computationally prohibitive for large models. Parameter-efficient fine-tuning (PEFT) methods like Low-Rank Adaptation (LoRA) reduce this cost by introducing low-rank matrices for tuning fewer parameters. While LoRA allows for efficient fine-tuning, it requires significant data for adaptation, making Federated Learning (FL) an appealing solution due to its privacy-preserving collaborative framework. However, combining LoRA with FL introduces two key challenges: the \textbf{Server-Side Aggregation Bias}, where server-side averaging of LoRA matrices diverges from the ideal global update, and the \textbf{Client-Side Initialization Lag}, emphasizing the need for consistent initialization across rounds. Existing approaches address these challenges individually, limiting their effectiveness. We propose LoRA-FAIR, a novel method that tackles both issues by introducing a correction term on the server, enhancing aggregation efficiency and accuracy. LoRA-FAIR maintains computational and communication efficiency, yielding superior performance over state-of-the-art methods. Experimental results on ViT and MLP-Mixer models across large-scale datasets demonstrate that LoRA-FAIR consistently achieves performance improvements in FL settings.
