FedEAT: A Robustness Optimization Framework for Federated LLMs
Yahao Pang, Xingyuan Wu, Xiaojin Zhang, Wei Chen, Hai Jin
TL;DR
Federated LLMs confront robustness challenges from data heterogeneity and adversarial clients in privacy-preserving settings. FedEAT addresses this by integrating embedding-space adversarial training with a robust aggregation rule based on the geometric median, offering resilience against both heterogeneous data and malicious updates. Empirical results show FedEAT improves adversarial robustness with only minor utility loss across multiple architectures and tasks, demonstrating practical viability. This work provides a scalable, privacy-preserving approach for deploying robust Federated LLMs in sensitive domains by combining embedding-space defenses with robust aggregation.
Abstract
Significant advancements have been made by Large Language Models (LLMs) in the domains of natural language understanding and automated content creation. However, they still face persistent problems, including substantial computational costs and inadequate availability of training data. The combination of Federated Learning (FL) and LLMs (federated LLMs) offers a solution by leveraging distributed data while protecting privacy, which positions it as an ideal choice for sensitive domains. However, Federated LLMs still suffer from robustness challenges, including data heterogeneity, malicious clients, and adversarial attacks, which greatly hinder their applications. We first introduce the robustness problems in federated LLMs, to address these challenges, we propose FedEAT (Federated Embedding space Adversarial Training), a novel framework that applies adversarial training in the embedding space of client LLM and employs a robust aggregation approach, specifically geometric median aggregation, to enhance the robustness of Federated LLMs. Our experiments demonstrate that FedEAT effectively improves the robustness of Federated LLMs with minimal performance loss.
