Table of Contents
Fetching ...

Recent Advances in Federated Learning Driven Large Language Models: A Survey on Architecture, Performance, and Security

Youyang Qu, Ming Liu, Tianqing Zhu, Longxiang Gao, Shui Yu, Wanlei Zhou

TL;DR

The paper surveys Federated Learning (FL) approaches for training Large Language Models (LLMs) across decentralized data sources to preserve privacy and reduce communication. It surveys architectures, performance optimizations, and security considerations, including machine unlearning as a mechanism to comply with regulations. It reviews methods such as secure aggregation, differential privacy, prompt tuning, hierarchical aggregation, and model splitting, and discusses their trade-offs and practical implications via case studies. The work identifies key open challenges and directions for building secure, scalable, and adaptable FL-LMM systems with real-world impact in sensitive domains.

Abstract

Federated Learning (FL) offers a promising paradigm for training Large Language Models (LLMs) in a decentralized manner while preserving data privacy and minimizing communication overhead. This survey examines recent advancements in FL-driven LLMs, with a particular emphasis on architectural designs, performance optimization, and security concerns, including the emerging area of machine unlearning. In this context, machine unlearning refers to the systematic removal of specific data contributions from trained models to comply with privacy regulations such as the Right to be Forgotten. We review a range of strategies enabling unlearning in federated LLMs, including perturbation-based methods, model decomposition, and incremental retraining, while evaluating their trade-offs in terms of efficiency, privacy guarantees, and model utility. Through selected case studies and empirical evaluations, we analyze how these methods perform in practical FL scenarios. This survey identifies critical research directions toward developing secure, adaptable, and high-performing federated LLM systems for real-world deployment.

Recent Advances in Federated Learning Driven Large Language Models: A Survey on Architecture, Performance, and Security

TL;DR

The paper surveys Federated Learning (FL) approaches for training Large Language Models (LLMs) across decentralized data sources to preserve privacy and reduce communication. It surveys architectures, performance optimizations, and security considerations, including machine unlearning as a mechanism to comply with regulations. It reviews methods such as secure aggregation, differential privacy, prompt tuning, hierarchical aggregation, and model splitting, and discusses their trade-offs and practical implications via case studies. The work identifies key open challenges and directions for building secure, scalable, and adaptable FL-LMM systems with real-world impact in sensitive domains.

Abstract

Federated Learning (FL) offers a promising paradigm for training Large Language Models (LLMs) in a decentralized manner while preserving data privacy and minimizing communication overhead. This survey examines recent advancements in FL-driven LLMs, with a particular emphasis on architectural designs, performance optimization, and security concerns, including the emerging area of machine unlearning. In this context, machine unlearning refers to the systematic removal of specific data contributions from trained models to comply with privacy regulations such as the Right to be Forgotten. We review a range of strategies enabling unlearning in federated LLMs, including perturbation-based methods, model decomposition, and incremental retraining, while evaluating their trade-offs in terms of efficiency, privacy guarantees, and model utility. Through selected case studies and empirical evaluations, we analyze how these methods perform in practical FL scenarios. This survey identifies critical research directions toward developing secure, adaptable, and high-performing federated LLM systems for real-world deployment.
Paper Structure (18 sections, 6 figures, 5 tables)