Towards Client Driven Federated Learning
Songze Li, Chenqing Zhu
TL;DR
This paper tackles the challenge of asynchronous distribution shifts in federated learning by introducing Client-Driven Federated Learning (CDFL), where clients autonomously decide when to update and the server maintains multiple cluster models to tailor updates. The server performs distribution estimation using client uploads, updates cluster models with controlled updates, and sends a personalized model back to the client, all while preserving lightweight client computation and communication. The authors provide a convergence analysis under standard assumptions and validate the approach with extensive experiments on rotated image datasets and digit recognition tasks, showing improved client and cluster performance, reduced overhead, and superior distribution estimation than baselines. Overall, CDFL offers a practical, autonomous, and efficient framework for rapid adaptation to evolving data distributions in distributed settings.
Abstract
Conventional federated learning (FL) frameworks follow a server-driven model where the server determines session initiation and client participation, which faces challenges in accommodating clients' asynchronous needs for model updates. We introduce Client-Driven Federated Learning (CDFL), a novel FL framework that puts clients at the driving role. In CDFL, each client independently and asynchronously updates its model by uploading the locally trained model to the server and receiving a customized model tailored to its local task. The server maintains a repository of cluster models, iteratively refining them using received client models. Our framework accommodates complex dynamics in clients' data distributions, characterized by time-varying mixtures of cluster distributions, enabling rapid adaptation to new tasks with superior performance. In contrast to traditional clustered FL protocols that send multiple cluster models to a client to perform distribution estimation, we propose a paradigm that offloads the estimation task to the server and only sends a single model to a client, and novel strategies to improve estimation accuracy. We provide a theoretical analysis of CDFL's convergence. Extensive experiments across various datasets and system settings highlight CDFL's substantial advantages in model performance and computation efficiency over baselines.
