FedSC: Provable Federated Self-supervised Learning with Spectral Contrastive Objective over Non-i.i.d. Data
Shusen Jing, Anlan Yu, Shuai Zhang, Songyang Zhang
TL;DR
This paper addresses federated self-supervised learning when the global objective cannot be written as a simple sum of local objectives. It introduces FedSC, a provable FedSSL method built on the spectral contrastive objective and enabled by sharing correlation matrices among clients, complemented by differential privacy to bound additional data leakage. The authors provide convergence guarantees to a stationary point with a near $\mathcal{O}(1/\sqrt{T})$ rate and quantify the extra privacy leakage from sharing correlation matrices, showing it diminishes with larger local datasets. Empirically, FedSC achieves superior or competitive accuracy on SVHN, CIFAR-10, and CIFAR-100 under non-i.i.d. data distributions and remains robust under partial participation and DP protection. Overall, FedSC offers a theoretically grounded, communication-efficient, and privacy-aware approach to FedSSL with enhanced inter-client representation quality.
Abstract
Recent efforts have been made to integrate self-supervised learning (SSL) with the framework of federated learning (FL). One unique challenge of federated self-supervised learning (FedSSL) is that the global objective of FedSSL usually does not equal the weighted sum of local SSL objectives. Consequently, conventional approaches, such as federated averaging (FedAvg), fail to precisely minimize the FedSSL global objective, often resulting in suboptimal performance, especially when data is non-i.i.d.. To fill this gap, we propose a provable FedSSL algorithm, named FedSC, based on the spectral contrastive objective. In FedSC, clients share correlation matrices of data representations in addition to model weights periodically, which enables inter-client contrast of data samples in addition to intra-client contrast and contraction, resulting in improved quality of data representations. Differential privacy (DP) protection is deployed to control the additional privacy leakage on local datasets when correlation matrices are shared. We also provide theoretical analysis on the convergence and extra privacy leakage. The experimental results validate the effectiveness of our proposed algorithm.
