Robust Federated Finetuning of Foundation Models via Alternating Minimization of LoRA

Shuangyi Chen; Yue Ju; Hardik Dalal; Zhongwen Zhu; Ashish Khisti

Robust Federated Finetuning of Foundation Models via Alternating Minimization of LoRA

Shuangyi Chen, Yue Ju, Hardik Dalal, Zhongwen Zhu, Ashish Khisti

TL;DR

RoLoRA is introduced, a robust federated fine-tuning framework that utilizes an alternating minimization approach for LoRA, providing greater robustness against decreasing fine-tuning parameters and increasing data heterogeneity, and substantially enhances the robustness and effectiveness in multiple federated fine-tuning scenarios.

Abstract

Parameter-Efficient Fine-Tuning (PEFT) has risen as an innovative training strategy that updates only a select few model parameters, significantly lowering both computational and memory demands. PEFT also helps to decrease data transfer in federated learning settings, where communication depends on the size of updates. In this work, we explore the constraints of previous studies that integrate a well-known PEFT method named LoRA with federated fine-tuning, then introduce RoLoRA, a robust federated fine-tuning framework that utilizes an alternating minimization approach for LoRA, providing greater robustness against decreasing fine-tuning parameters and increasing data heterogeneity. Our results indicate that RoLoRA not only presents the communication benefits but also substantially enhances the robustness and effectiveness in multiple federated fine-tuning scenarios.

Robust Federated Finetuning of Foundation Models via Alternating Minimization of LoRA

TL;DR

Abstract

Paper Structure (29 sections, 2 equations, 2 figures, 8 tables)

This paper contains 29 sections, 2 equations, 2 figures, 8 tables.

Introduction
Related Work
Preliminaries
Low-Rank Adaptation: LoRA
FedAVG of LoRA Introduces Interference
FedRep: Common Representation via Alternating Minimization
Our Framework
Alternating Minimization and Corresponding Aggregation
Computation and Communication Cost
Experiments
Baselines
Model and Datasets.
Implementation.
Effect of Number of Finetuning Parameters
Effect of Data Heterogeneity
...and 14 more sections

Figures (2)

Figure 1: Results with RoBERTa-Large models on GLUE of different budget of finetuning parameters. The accuracy is computed by averaging over different ranks $\{1,2,4,8\}$. The number of clients is 3.
Figure 2: Accuracies over rounds with RoBERTa-Large models on SST-2, QNLI, MNLI, and QQP. The total number of clients is 50.

Robust Federated Finetuning of Foundation Models via Alternating Minimization of LoRA

TL;DR

Abstract

Robust Federated Finetuning of Foundation Models via Alternating Minimization of LoRA

Authors

TL;DR

Abstract

Table of Contents

Figures (2)