FilFL: Client Filtering for Optimized Client Participation in Federated Learning
Fares Fourati, Salma Kharrat, Vaneet Aggarwal, Mohamed-Slim Alouini, Marco Canini
TL;DR
FilFL introduces a novel client-filtering layer for federated learning that selects a cooperative subset of clients via a central, non-monotone combinatorial objective solved with a greedy algorithm. The approach, implemented as χGF with deterministic and randomized variants, is followed by standard FL client selection on the filtered set, yielding faster convergence and up to about 10 percentage points higher test accuracy across vision and language tasks under time-varying client availability. The authors provide convergence guarantees showing a $\mathcal{O}(\tfrac{1}{t})$ rate up to a filtering-dependent neighborhood, and demonstrate strong empirical gains along with robustness through ablations. This technique reduces communication and computation while improving generalization, with open-source code and a companion technical report for deeper theory.
Abstract
Federated learning, an emerging machine learning paradigm, enables clients to collaboratively train a model without exchanging local data. Clients participating in the training process significantly impact the convergence rate, learning efficiency, and model generalization. We propose a novel approach, client filtering, to improve model generalization and optimize client participation and training. The proposed method periodically filters available clients to identify a subset that maximizes a combinatorial objective function with an efficient greedy filtering algorithm. Thus, the clients are assessed as a combination rather than individually. We theoretically analyze the convergence of federated learning with client filtering in heterogeneous settings and evaluate its performance across diverse vision and language tasks, including realistic scenarios with time-varying client availability. Our empirical results demonstrate several benefits of our approach, including improved learning efficiency, faster convergence, and up to 10% higher test accuracy than training without client filtering.
