Variational Bayes for Federated Continual Learning
Dezhong Yao, Sanmu Li, Yutong Dai, Zhiqiang Xu, Shengshan Hu, Peilin Zhao, Lichao Sun
TL;DR
This work addresses Federated Continual Learning under non-stationary, privacy-constrained data by introducing FedBNN, a Federated Bayesian Neural Network trained via variational inference. It integrates historical and local distributions through history-aware local inference, local likelihood extraction, and global aggregation to form a coherent global posterior without requiring explicit task boundaries. A Prototype Library handles dynamic label spaces, while SNN-based initialization mitigates prior-driven forgetting during distribution drift. Experimental results across class- and task-incremental and gradual FCL settings show FedBNN achieves state-of-the-art forgetting mitigation and competitive adaptation, with uncertainty estimates enabling safer predictions in real-world deployments.
Abstract
Federated continual learning (FCL) has received increasing attention due to its potential in handling real-world streaming data, characterized by evolving data distributions and varying client classes over time. The constraints of storage limitations and privacy concerns confine local models to exclusively access the present data within each learning cycle. Consequently, this restriction induces performance degradation in model training on previous data, termed "catastrophic forgetting". However, existing FCL approaches need to identify or know changes in data distribution, which is difficult in the real world. To release these limitations, this paper directs attention to a broader continuous framework. Within this framework, we introduce Federated Bayesian Neural Network (FedBNN), a versatile and efficacious framework employing a variational Bayesian neural network across all clients. Our method continually integrates knowledge from local and historical data distributions into a single model, adeptly learning from new data distributions while retaining performance on historical distributions. We rigorously evaluate FedBNN's performance against prevalent methods in federated learning and continual learning using various metrics. Experimental analyses across diverse datasets demonstrate that FedBNN achieves state-of-the-art results in mitigating forgetting.
