Balancing Similarity and Complementarity for Federated Learning

Kunda Yan; Sen Cui; Abudukelimu Wuerkaixi; Jingfeng Zhang; Bo Han; Gang Niu; Masashi Sugiyama; Changshui Zhang

Balancing Similarity and Complementarity for Federated Learning

Kunda Yan, Sen Cui, Abudukelimu Wuerkaixi, Jingfeng Zhang, Bo Han, Gang Niu, Masashi Sugiyama, Changshui Zhang

TL;DR

The paper addresses non-i.i.d. data challenges in Federated Learning by arguing that optimal cooperation is not achieved by pursuing maximum model similarity alone. It introduces FedSaC, a two-stage framework that learns a cooperation network by jointly optimizing a weighted mix of similarity and feature complementarity, the latter quantified via principal angles between local data subspaces obtained from SVD. The method uses a server-side optimization to derive the adjacency matrix and broadcasts aggregated models, followed by client-side refinement that respects the server-derived cooperation while fitting local data. Empirical results on unimodal (CIFAR-10/100) and multimodal (CUB200-2011) benchmarks show that FedSaC consistently surpasses state-of-the-art FL methods across various heterogeneity regimes, validating the importance of exploiting data complementarity in cooperative learning.

Abstract

In mobile and IoT systems, Federated Learning (FL) is increasingly important for effectively using data while maintaining user privacy. One key challenge in FL is managing statistical heterogeneity, such as non-i.i.d. data, arising from numerous clients and diverse data sources. This requires strategic cooperation, often with clients having similar characteristics. However, we are interested in a fundamental question: does achieving optimal cooperation necessarily entail cooperating with the most similar clients? Typically, significant model performance improvements are often realized not by partnering with the most similar models, but through leveraging complementary data. Our theoretical and empirical analyses suggest that optimal cooperation is achieved by enhancing complementarity in feature distribution while restricting the disparity in the correlation between features and targets. Accordingly, we introduce a novel framework, \texttt{FedSaC}, which balances similarity and complementarity in FL cooperation. Our framework aims to approximate an optimal cooperation network for each client by optimizing a weighted sum of model similarity and feature complementarity. The strength of \texttt{FedSaC} lies in its adaptability to various levels of data heterogeneity and multimodal scenarios. Our comprehensive unimodal and multimodal experiments demonstrate that \texttt{FedSaC} markedly surpasses other state-of-the-art FL methods.

Balancing Similarity and Complementarity for Federated Learning

TL;DR

Abstract

Paper Structure (40 sections, 14 equations, 7 figures, 6 tables, 1 algorithm)

This paper contains 40 sections, 14 equations, 7 figures, 6 tables, 1 algorithm.

Introduction
Related Work
Federated Learning and Statistical Heterogeneity
Personalized Federated Learning
Federated Multimodal Learning
Problem Setup
Notations
Statistical Heterogeneity
Optimal Cooperation
Our Method: Balancing Similarity and Complementarity
Cooperation Network
Optimization with Similarity and Complementarity
FedSaC: Balancing Similarity and Complementarity
The Metric of Similarity and Complementarity
FedSaC in FL architecture
...and 25 more sections

Figures (7)

Figure 1: Illustration of the role of data complementarity on personalized federated learning cooperation. The figure presents the experimental results how increasing data complementarity between two clients influences average accuracy post-cooperation and model similarity. Three scenarios are presented, showcasing distinct levels of complementarity for local data distributions. The findings underscore the benefits of complementarity, revealing that a balance of similarity and complementarity enhances cooperative benefits in a federated learning framework.
Figure 2: Illustration of our FedSaC approach. Local clients train models by minimizing empirical risk, incorporating a regularization term based on the distance to the aggregated model. Post-training, models are distilled via SVD to capture the representative subspace, which, alongside model parameters, is sent to the server. The server constructs a cooperation network, balancing similarity and complementarity among clients, to aggregate models. These aggregated models are then disseminated to clients for the subsequent training iteration.
Figure 3: Visualization of FedSaC: local data, process matrices, and cooperation networks under three collaboration states
Figure 4: Illustration of the level of heterogeneity under four distinct partitioning schemes.
Figure 5: Average accuracy curves of the four partitions under various hyperparameter $\alpha$ settings.
...and 2 more figures

Theorems & Definitions (1)

Definition 3.1

Balancing Similarity and Complementarity for Federated Learning

TL;DR

Abstract

Balancing Similarity and Complementarity for Federated Learning

Authors

TL;DR

Abstract

Table of Contents

Figures (7)

Theorems & Definitions (1)