EcoLoRA: Communication-Efficient Federated Fine-Tuning of Large Language Models

Han Liu; Ruoyao Wen; Srijith Nair; Jia Liu; Wenjing Lou; Chongjie Zhang; William Yeoh; Yevgeniy Vorobeychik; Ning Zhang

EcoLoRA: Communication-Efficient Federated Fine-Tuning of Large Language Models

Han Liu, Ruoyao Wen, Srijith Nair, Jia Liu, Wenjing Lou, Chongjie Zhang, William Yeoh, Yevgeniy Vorobeychik, Ning Zhang

TL;DR

EcoLoRA tackles the communication bottleneck in federated fine-tuning of large language models by introducing a round-robin segment sharing scheme for LoRA updates, an adaptive sparsification strategy tailored to LoRA's A and B matrices, and lossless encoding via Golomb coding. The approach achieves substantial communication reductions—up to 89% in upload and up to 79% in communication time—while preserving accuracy across QA and value-alignment tasks and under realistic non-IID conditions. A convergence analysis under standard FL assumptions provides a theoretical guarantee with an $O(T^{-1/2})$ rate when using a suitably decaying learning rate. Extensive experiments on multiple models and datasets, plus ablation and practical-network studies, demonstrate EcoLoRA’s robustness, generality, and practical impact for enabling efficient, privacy-preserving FL of LLMs.

Abstract

To address data locality and privacy restrictions, Federated Learning (FL) has recently been adopted to fine-tune large language models (LLMs), enabling improved performance on various downstream tasks without requiring aggregated data. However, the repeated exchange of model updates in FL can result in prohibitively high communication costs, hindering the distributed learning process. To address this challenge, we propose EcoLoRA, a novel communication-efficient federated fine-tuning framework for LLMs. Leveraging the modular structure, we propose a round-robin segment sharing scheme, where each client uploads only a complementary LoRA segment per round to reduce network bandwidth. It is further combined with adaptive sparsification methods tailored to LoRA's training dynamics and lossless encoding techniques. We conduct extensive evaluations on both question-answering and value-alignment tasks across multiple datasets and models. The results show that EcoLoRA significantly reduces communication overhead without compromising performance. For instance, it reduces communication time by up to 79% and total training time by up to 65%.

EcoLoRA: Communication-Efficient Federated Fine-Tuning of Large Language Models

TL;DR

Abstract

EcoLoRA: Communication-Efficient Federated Fine-Tuning of Large Language Models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (3)