PaDPaF: Partial Disentanglement with Partially-Federated GANs
Abdulla Jasem Almansoori, Samuel Horváth, Martin Takáč
TL;DR
PaDPaF tackles learning personalized generative models under data heterogeneity by partially federating GAN components to separate content (global) from style (local). It introduces a partial-federation architecture with a global content encoder/discriminator and client-specific style modules, trained via FedAvg-style updates, and a latent-contrastive regularizer inspired by Barlow Twins to enforce content–style disentanglement, ensuring a robust, client-invariant content representation. The framework enables high-accuracy content classification with a simple linear head and the generation of locally unseen labels, while supporting data anonymization by sharing only content. Empirical evaluations on MNIST, CIFAR-10, and CelebA show that content features carry the majority of label information and that PaDPaF outperforms baselines in both generation and style transfer tasks, with potential extensions to VAEs and broader modalities.
Abstract
Federated learning has become a popular machine learning paradigm with many potential real-life applications, including recommendation systems, the Internet of Things (IoT), healthcare, and self-driving cars. Though most current applications focus on classification-based tasks, learning personalized generative models remains largely unexplored, and their benefits in the heterogeneous setting still need to be better understood. This work proposes a novel architecture combining global client-agnostic and local client-specific generative models. We show that using standard techniques for training federated models, our proposed model achieves privacy and personalization by implicitly disentangling the globally consistent representation (i.e. content) from the client-dependent variations (i.e. style). Using such decomposition, personalized models can generate locally unseen labels while preserving the given style of the client and can predict the labels for all clients with high accuracy by training a simple linear classifier on the global content features. Furthermore, disentanglement enables other essential applications, such as data anonymization, by sharing only the content. Extensive experimental evaluation corroborates our findings, and we also discuss a theoretical motivation for the proposed approach.
