Rethinking the Representation in Federated Unsupervised Learning with Non-IID Data

Xinting Liao; Weiming Liu; Chaochao Chen; Pengyang Zhou; Fengyuan Yu; Huabin Zhu; Binhui Yao; Tao Wang; Xiaolin Zheng; Yanchao Tan

Rethinking the Representation in Federated Unsupervised Learning with Non-IID Data

Xinting Liao, Weiming Liu, Chaochao Chen, Pengyang Zhou, Fengyuan Yu, Huabin Zhu, Binhui Yao, Tao Wang, Xiaolin Zheng, Yanchao Tan

TL;DR

This paper tackles federated unsupervised learning with non-IID data by addressing two core issues: representation collapse across local and global models, and misaligned representation spaces among clients. It proposes FedU2, a framework combining a Flexible Uniform Regularizer (FUR) and an Efficient Unified Aggregator (EUA) to enforce uniform and unified representations, respectively. The method uses unbalanced optimal transport to push local representations toward a spherical Gaussian and a multi-objective ADMM-based server aggregation to balance model updates across clients. Experiments on CIFAR10 and CIFAR100 show that FedU2 outperforms baselines in cross-device and cross-silo settings, with ablations confirming the contributions of both FUR and EUA and visualization analyses illustrating improved representation coherence.

Abstract

Federated learning achieves effective performance in modeling decentralized data. In practice, client data are not well-labeled, which makes it potential for federated unsupervised learning (FUSL) with non-IID data. However, the performance of existing FUSL methods suffers from insufficient representations, i.e., (1) representation collapse entanglement among local and global models, and (2) inconsistent representation spaces among local models. The former indicates that representation collapse in local model will subsequently impact the global model and other local models. The latter means that clients model data representation with inconsistent parameters due to the deficiency of supervision signals. In this work, we propose FedU2 which enhances generating uniform and unified representation in FUSL with non-IID data. Specifically, FedU2 consists of flexible uniform regularizer (FUR) and efficient unified aggregator (EUA). FUR in each client avoids representation collapse via dispersing samples uniformly, and EUA in server promotes unified representation by constraining consistent client model updating. To extensively validate the performance of FedU2, we conduct both cross-device and cross-silo evaluation experiments on two benchmark datasets, i.e., CIFAR10 and CIFAR100.

Rethinking the Representation in Federated Unsupervised Learning with Non-IID Data

TL;DR

Abstract

Paper Structure (22 sections, 6 theorems, 32 equations, 11 figures, 2 tables, 1 algorithm)

This paper contains 22 sections, 6 theorems, 32 equations, 11 figures, 2 tables, 1 algorithm.

Introduction
Related Work
Federated Unsupervised Learning
Representation Collapse
Method
Federated Unsupervised Learning Formulation
Fed$\text{U}^2$ Overview
FUR for Mitigating Representation Collapse
EUA for Generalizing Unified Representation
Overall Algorithm and Convergence Analysis
Experiments
Experimental Setups
Experimental Results
Representation Visualization
Conclusion
...and 7 more sections

Key Result

Theorem 1

Rethinking the Lagrangian of dual form in Eq. eq:mtl_cc_dual, it holds $\nabla \log( {u}_i\left(\boldsymbol{\theta}^{t}_g\right)) =\nabla \log( {u}_j\left(\boldsymbol{\theta}^{t}_g\right))$, $\forall i \neq j\in [K]$.

Figures (11)

Figure 1: Framework of Fed$\text{U}^2$. For clients with agnostic self-supervised framework, FUR expands non-IID data uniformly to avoid representation collapse for FUSL. EUA in server maintains a balanced aggregation for all client models, bringing unified representations.
Figure 2: Example of FUR. Firstly, data representations collapse in part of the spherical space. Then FUR flexibly maps data towards spherical Gaussian distribution with unbalanced optimal transport (UOT), dispersing data uniformly.
Figure 3: Top k log singular values of the covariance matrix of global model (left) and local model (right) representations.
Figure 4: The representations collapse issue on the sphere using BYOL model (on CIFAR10 $\alpha=0.1$ Cross-silo). The more blank representation space means the more severe collapse issue is.
Figure 5: The distributions of data representations using global and local BYOL model (on CIFAR10 $\alpha=0.1$ Cross-silo).
...and 6 more figures

Theorems & Definitions (13)

Theorem 1: Optimization consistency of model deviations
proof
Lemma 1: Bound of Client Model Divergence
proof
Theorem 2: Convergence Error Bound
proof
Theorem 3: Optimization consistency of model deviations
proof
Lemma 2: Bound of Client Model Divergence
proof
...and 3 more

Rethinking the Representation in Federated Unsupervised Learning with Non-IID Data

TL;DR

Abstract

Rethinking the Representation in Federated Unsupervised Learning with Non-IID Data

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (11)

Theorems & Definitions (13)