Federated Learning for Non-factorizable Models using Deep Generative Prior Approximations
Conor Hassan, Joshua J Bon, Elizaveta Semenova, Antonietta Mira, Kerrie Mengersen
TL;DR
This work tackles the restriction of current federated learning (FL) methods that assume conditional independence across clients by introducing Structured Independence via deep Generative Model Approximations (SIGMA) priors. SIGMA learns a hierarchical latent representation with a global latent shared across clients and local latents per client to capture dependencies while preserving a tractable structure for FL updates, enabling the use of established FL algorithms such as Structured Federated Variational Inference. The authors demonstrate SIGMA on synthetic 1D GP regression and a real-world-inspired spatial CAR-prior problem over Australia, showing that SIGMA can approximate complex priors and provide posterior inferences close to non-FL benchmarks, with an auxiliary-variable extension improving calibration and reducing overfitting. The approach broadens the applicability of FL to domains where dependencies across geographically or otherwise linked clients are essential, and suggests future work including more expressive generative models and uncertainty-aware variants.
Abstract
Federated learning (FL) allows for collaborative model training across decentralized clients while preserving privacy by avoiding data sharing. However, current FL methods assume conditional independence between client models, limiting the use of priors that capture dependence, such as Gaussian processes (GPs). We introduce the Structured Independence via deep Generative Model Approximation (SIGMA) prior which enables FL for non-factorizable models across clients, expanding the applicability of FL to fields such as spatial statistics, epidemiology, environmental science, and other domains where modeling dependencies is crucial. The SIGMA prior is a pre-trained deep generative model that approximates the desired prior and induces a specified conditional independence structure in the latent variables, creating an approximate model suitable for FL settings. We demonstrate the SIGMA prior's effectiveness on synthetic data and showcase its utility in a real-world example of FL for spatial data, using a conditional autoregressive prior to model spatial dependence across Australia. Our work enables new FL applications in domains where modeling dependent data is essential for accurate predictions and decision-making.
