Table of Contents
Fetching ...

Drift-Aware Federated Learning: A Causal Perspective

Yunjie Fang, Sheng Wu, Tao Yang, Xiaofeng Wu, Bo Hu

TL;DR

This work addresses feature drift in federated learning arising from data non-iid and participation imbalance. It introduces CAFE, a drift-aware method rooted in causal inference, featuring invariant feature calibration, parameter calibration, and history-aware aggregation to decouple confounding effects from classifier decisions. The authors provide a formal causal graph, interventions, and an inference-time deconfounding strategy, backed by theoretical analysis and extensive experiments on CIFAR-10/100 and Fashion-MNIST showing improved accuracy and robustness over strong baselines. The approach demonstrates practical impact for privacy-preserving FL in heterogeneous environments by reducing drift towards majority classes and frequently participating clients while maintaining strong performance under extreme data and device heterogeneity.

Abstract

Federated learning (FL) facilitates collaborative model training among multiple clients while preserving data privacy, often resulting in enhanced performance compared to models trained by individual clients. However, factors such as communication frequency and data distribution can contribute to feature drift, hindering the attainment of optimal training performance. This paper examine the relationship between model update drift and global as well as local optimizer from causal perspective. The influence of the global optimizer on feature drift primarily arises from the participation frequency of certain clients in server updates, whereas the effect of the local optimizer is typically associated with imbalanced data distributions.To mitigate this drift, we propose a novel framework termed Causal drift-Aware Federated lEarning (CAFE). CAFE exploits the causal relationship between feature-invariant components and classification outcomes to independently calibrate local client sample features and classifiers during the training phase. In the inference phase, it eliminated the drifts in the global model that favor frequently communicating clients.Experimental results demonstrate that CAFE's integration of feature calibration, parameter calibration, and historical information effectively reduces both drift towards majority classes and tendencies toward frequently communicating nodes.

Drift-Aware Federated Learning: A Causal Perspective

TL;DR

This work addresses feature drift in federated learning arising from data non-iid and participation imbalance. It introduces CAFE, a drift-aware method rooted in causal inference, featuring invariant feature calibration, parameter calibration, and history-aware aggregation to decouple confounding effects from classifier decisions. The authors provide a formal causal graph, interventions, and an inference-time deconfounding strategy, backed by theoretical analysis and extensive experiments on CIFAR-10/100 and Fashion-MNIST showing improved accuracy and robustness over strong baselines. The approach demonstrates practical impact for privacy-preserving FL in heterogeneous environments by reducing drift towards majority classes and frequently participating clients while maintaining strong performance under extreme data and device heterogeneity.

Abstract

Federated learning (FL) facilitates collaborative model training among multiple clients while preserving data privacy, often resulting in enhanced performance compared to models trained by individual clients. However, factors such as communication frequency and data distribution can contribute to feature drift, hindering the attainment of optimal training performance. This paper examine the relationship between model update drift and global as well as local optimizer from causal perspective. The influence of the global optimizer on feature drift primarily arises from the participation frequency of certain clients in server updates, whereas the effect of the local optimizer is typically associated with imbalanced data distributions.To mitigate this drift, we propose a novel framework termed Causal drift-Aware Federated lEarning (CAFE). CAFE exploits the causal relationship between feature-invariant components and classification outcomes to independently calibrate local client sample features and classifiers during the training phase. In the inference phase, it eliminated the drifts in the global model that favor frequently communicating clients.Experimental results demonstrate that CAFE's integration of feature calibration, parameter calibration, and historical information effectively reduces both drift towards majority classes and tendencies toward frequently communicating nodes.

Paper Structure

This paper contains 29 sections, 5 theorems, 37 equations, 8 figures, 2 tables, 2 algorithms.

Key Result

Theorem 1

(General Form of Feature Update) The update of features is influenced by the data distribution of the current batch, the local optimizer, and the global model, all of which collectively amplify the drift in feature updates. The general form of feature updates in both Systems A and B is: $-\sum_{i=1}

Figures (8)

  • Figure 1: The reasons for feature drift in federated learning. The non-iid data distributions among clients cause their models to converge to different optimal points. Clients trained on class-imbalanced data exhibit low recognition accuracy for minority classes, while participation imbalance results in the global model being biased towards clients that communicate more frequently.
  • Figure 2: The causal graph of FL (a) and the process of implementing causal interventions in (b) and (c).
  • Figure 3: The detailed process of the CAFE algorithm during the training phase involves calibrating the embedding vectors and classifier parameters. Subsequently, historical information is integrated to achieve robust and drift-aware classification results for complex scenarios.
  • Figure 4: Enter Caption
  • Figure 5: Accuracy
  • ...and 3 more figures

Theorems & Definitions (9)

  • Theorem 1
  • proof
  • Theorem 2
  • proof
  • Theorem 3
  • proof
  • Lemma 1
  • proof
  • Proposition 1