FedEGG: Federated Learning with Explicit Global Guidance
Kun Zhai, Yifeng Gao, Difan Zou, Guangnan Ye, Siheng Chen, Xingjun Ma, Yu-Gang Jiang
TL;DR
FedEGG introduces an explicit server-side global guiding task for federated learning, constructed from a public dataset and LLMs, and couples it with the standard FL objective in a two-phase, dual-optimization framework. The method includes a convergence-aware guiding strength controlled by a log-loss ratio constraint and a threshold $\tau$ derived from the cosine similarity between guiding and FL data representations. Theoretical analysis provides an upper bound on guiding strength and shows potential convergence acceleration when the guiding task sufficiently aids the FL task, with termination conditions tied to data heterogeneity $\Gamma_g$ and alignment measure $\Pi$. Empirically, FedEGG improves over state-of-the-art FL methods across IID and non-IID settings, particularly under high heterogeneity, and can enhance existing FL methods when used in combination, demonstrating practical impact for real-world privacy-preserving learning.
Abstract
Federated Learning (FL) holds great potential for diverse applications owing to its privacy-preserving nature. However, its convergence is often challenged by non-IID data distributions, limiting its effectiveness in real-world deployments. Existing methods help address these challenges via optimization-based client constraints, adaptive client selection, or the use of pre-trained models or synthetic data. In this work, we reinterpret these approaches as all introducing an \emph{implicit guiding task} to regularize and steer client learning. Following this insight, we propose to introduce an \emph{explicit global guiding task} into the current FL framework to improve convergence and performance. To this end, we present \textbf{FedEGG}, a new FL algorithm that constructs a global guiding task using a well-defined, easy-to-converge learning task based on a public dataset and Large Language Models (LLMs). This approach effectively combines the strengths of federated (the original FL task) and centralized (the global guiding task) learning. We provide a theoretical analysis of FedEGG's convergence, examining the impact of data heterogeneity between the guiding and FL tasks and the guiding strength. Our analysis derives an upper bound for the optimal guiding strength, offering practical insights for implementation. Empirically, FedEGG demonstrates superior performance over state-of-the-art FL methods under both IID and non-IID settings, and further improves their performances when combined.
