Personalized Bayesian Federated Learning with Wasserstein Barycenter Aggregation
Ting Wei, Biao Mei, Junliang Lyu, Renquan Zhang, Feng Zhou, Yifan Sun
TL;DR
This work tackles personalized Bayesian federated learning under non-i.i.d. client data by marrying nonparametric local posterior inference with geometry-aware global aggregation. It introduces FedWBA, which uses particle-based variational inference (SVGD) to represent local posteriors and a particle-based Wasserstein barycenter to aggregate these posteriors into a global prior, with kernel density estimation ensuring a continuous prior for SVGD updates. The authors prove local ELBO growth and global barycenter consistency, and empirically show improved prediction accuracy, uncertainty calibration (lower ECE), and faster convergence across multiple datasets and client regimes. The approach offers a principled, uncertainty-aware alternative to Gaussian-posteriors and simple averaging, with clear implications for safety-critical, privacy-preserving distributed learning.
Abstract
Personalized Bayesian federated learning (PBFL) handles non-i.i.d. client data and quantifies uncertainty by combining personalization with Bayesian inference. However, existing PBFL methods face two limitations: restrictive parametric assumptions in client posterior inference and naive parameter averaging for server aggregation. To overcome these issues, we propose FedWBA, a novel PBFL method that enhances both local inference and global aggregation. At the client level, we use particle-based variational inference for nonparametric posterior representation. At the server level, we introduce particle-based Wasserstein barycenter aggregation, offering a more geometrically meaningful approach. Theoretically, we provide local and global convergence guarantees for FedWBA. Locally, we prove a KL divergence decrease lower bound per iteration for variational inference convergence. Globally, we show that the Wasserstein barycenter converges to the true parameter as the client data size increases. Empirically, experiments show that FedWBA outperforms baselines in prediction accuracy, uncertainty calibration, and convergence rate, with ablation studies confirming its robustness.
