FilFL: Client Filtering for Optimized Client Participation in Federated Learning

Fares Fourati; Salma Kharrat; Vaneet Aggarwal; Mohamed-Slim Alouini; Marco Canini

FilFL: Client Filtering for Optimized Client Participation in Federated Learning

Fares Fourati, Salma Kharrat, Vaneet Aggarwal, Mohamed-Slim Alouini, Marco Canini

TL;DR

FilFL introduces a novel client-filtering layer for federated learning that selects a cooperative subset of clients via a central, non-monotone combinatorial objective solved with a greedy algorithm. The approach, implemented as χGF with deterministic and randomized variants, is followed by standard FL client selection on the filtered set, yielding faster convergence and up to about 10 percentage points higher test accuracy across vision and language tasks under time-varying client availability. The authors provide convergence guarantees showing a $\mathcal{O}(\tfrac{1}{t})$ rate up to a filtering-dependent neighborhood, and demonstrate strong empirical gains along with robustness through ablations. This technique reduces communication and computation while improving generalization, with open-source code and a companion technical report for deeper theory.

Abstract

Federated learning, an emerging machine learning paradigm, enables clients to collaboratively train a model without exchanging local data. Clients participating in the training process significantly impact the convergence rate, learning efficiency, and model generalization. We propose a novel approach, client filtering, to improve model generalization and optimize client participation and training. The proposed method periodically filters available clients to identify a subset that maximizes a combinatorial objective function with an efficient greedy filtering algorithm. Thus, the clients are assessed as a combination rather than individually. We theoretically analyze the convergence of federated learning with client filtering in heterogeneous settings and evaluate its performance across diverse vision and language tasks, including realistic scenarios with time-varying client availability. Our empirical results demonstrate several benefits of our approach, including improved learning efficiency, faster convergence, and up to 10% higher test accuracy than training without client filtering.

FilFL: Client Filtering for Optimized Client Participation in Federated Learning

TL;DR

rate up to a filtering-dependent neighborhood, and demonstrate strong empirical gains along with robustness through ablations. This technique reduces communication and computation while improving generalization, with open-source code and a companion technical report for deeper theory.

Abstract

Paper Structure (38 sections, 6 theorems, 42 equations, 22 figures, 3 tables, 2 algorithms)

This paper contains 38 sections, 6 theorems, 42 equations, 22 figures, 3 tables, 2 algorithms.

Problem Formulation
Training Objective
Filtering Objective
Client Filtering
Client Filtering in FL (FilFL)
Greedy Filtering ($\chi$GF)
FilFL Convergence Analysis
Assumptions and Definitions
FilFL Theoretical Convergence Results
Experiments
Setup
FilFL Outperforms Standard FL Algorithms
FilFL (FedAvg with $\chi$GF and PoC) vs FedAvg (PoC)
FilFL (FedProx with $\chi$GF and RS) vs FedProx (RS)
FilFL (FedAvg with $\chi$GF and RS) vs FedAvg (DivFL).
...and 23 more sections

Key Result

Theorem 1

Let assumptions ass1, ass2, ass3, ass4, ass5, and samplingassumptio hold, then we have for some time constant $\varphi$ that depends on the filtering.

Figures (22)

Figure 1: Visualization of two scenarios with different suggested descent directions from different clients. Arrows are color-coded to indicate the quality of direction: blue for optimal, orange for favorable outlier, green for majority consensus, and red for unfavorable outlier.
Figure 2: FilFL incorporates client filtering in FL, which is activated when the boolean condition 'Bool' becomes true, either when new clients become available or when $h$ rounds have elapsed since the last filtering call. Otherwise, the condition remains false. In both scenarios, clients are selected from the filtered-in subset of clients, denoted as $\mathcal{S}^f$.
Figure 3: FilFL (FedAvg with $\chi$GF) vs FedAvg (w/o filtering) both with PoC on Shakespeare dataset with $N = 143$, $n=100$, $K=10$, $m=34$, and $h=5$.
Figure 4: FilFL (FedProx with $\chi$GF) vs FedProx (w/o filtering) both with RS on FEMNIST dataset with $N = 190$, $n=50$, $K=5$, $m=2000$, and $h=5$.
Figure 5: FilFL (FedProx + $\chi$GF + RS) vs FedProx (RS) without filtering on Shakespeare dataset.
...and 17 more figures

Theorems & Definitions (17)

Remark 1
Remark 2
Remark 3
Remark 4
Remark 5
Theorem 1
proof
Lemma 1
proof
Lemma 2
...and 7 more

FilFL: Client Filtering for Optimized Client Participation in Federated Learning

TL;DR

Abstract

FilFL: Client Filtering for Optimized Client Participation in Federated Learning

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (22)

Theorems & Definitions (17)