FedMeS: Personalized Federated Continual Learning Leveraging Local Memory
Jin Xie, Chenqing Zhu, Songze Li
TL;DR
FedMeS tackles Personalized Federated Continual Learning under non-IID task streams by equipping each client with a small local memory used both to calibrate gradient updates during training and to enable KNN-based Gaussian inference during testing. The framework blends local memory-driven gradient corrections with a dynamic regularization that links local and global knowledge, and employs a task-oblivious inference pipeline that combines a personalized model with memory-based predictions. Theoretical convergence results under standard SGD assumptions show the memory-guided updates drive local losses toward optima, while experiments across diverse PFCL benchmarks demonstrate consistent improvements in average accuracy and forgetting rates compared to strong baselines, including FedWeIT. FedMeS also demonstrates memory efficiency and favorable computation-communication trade-offs, indicating practical suitability for scalable PFCL systems.
Abstract
We focus on the problem of Personalized Federated Continual Learning (PFCL): a group of distributed clients, each with a sequence of local tasks on arbitrary data distributions, collaborate through a central server to train a personalized model at each client, with the model expected to achieve good performance on all local tasks. We propose a novel PFCL framework called Federated Memory Strengthening FedMeS to address the challenges of client drift and catastrophic forgetting. In FedMeS, each client stores samples from previous tasks using a small amount of local memory, and leverages this information to both 1) calibrate gradient updates in training process; and 2) perform KNN-based Gaussian inference to facilitate personalization. FedMeS is designed to be task-oblivious, such that the same inference process is applied to samples from all tasks to achieve good performance. FedMeS is analyzed theoretically and evaluated experimentally. It is shown to outperform all baselines in average accuracy and forgetting rate, over various combinations of datasets, task distributions, and client numbers.
