Calibre: Towards Fair and Accurate Personalized Federated Learning with Self-Supervised Learning
Sijia Chen, Ningxin Su, Baochun Li
TL;DR
Calibre tackles fair and accurate personalized federated learning in non-i.i.d. settings by calibrating self-supervised representations through client-specific prototypes and a prototype-guided aggregation mechanism. It formalizes the Generality-Personalization Tradeoff with a theorem that balances information flows via a constrained objective, implemented as a four-term loss $L^c = l_c + l_s + \alpha(l_p + l_n)$ augmented by prototype generation and client-adaptive regularizers. Empirically, Calibre achieves state-of-the-art mean accuracy and reduced fairness variance across CIFAR-10, CIFAR-100, and STL-10, and extends effectively to novel unseen clients, outperforming label-dependent and other SSL-based pFL baselines. The approach provides a practical, SSL-based framework for reliable personalization in FL by preserving fairness while leveraging unlabeled data and lightweight per-client heads, with potential impact on privacy-preserving collaborative learning in heterogeneous environments.
Abstract
In the context of personalized federated learning, existing approaches train a global model to extract transferable representations, based on which any client could train personalized models with a limited number of data samples. Self-supervised learning is considered a promising direction as the global model it produces is generic and facilitates personalization for all clients fairly. However, when data is heterogeneous across clients, the global model trained using SSL is unable to learn high-quality personalized models. In this paper, we show that when the global model is trained with SSL without modifications, its produced representations have fuzzy class boundaries. As a result, personalized learning within each client produces models with low accuracy. In order to improve SSL towards better accuracy without sacrificing its advantage in fairness, we propose Calibre, a new personalized federated learning framework designed to calibrate SSL representations by maintaining a suitable balance between more generic and more client-specific representations. Calibre is designed based on theoretically-sound properties, and introduces (1) a client-specific prototype loss as an auxiliary training objective; and (2) an aggregation algorithm guided by such prototypes across clients. Our experimental results in an extensive array of non-i.i.d.~settings show that Calibre achieves state-of-the-art performance in terms of both mean accuracy and fairness across clients. Code repo: https://github.com/TL-System/plato/tree/main/examples/ssl/calibre.
