Diffusion-Driven Data Replay: A Novel Approach to Combat Forgetting in Federated Class Continual Learning
Jinglin Liang, Jin Zhong, Hanlin Gu, Zhongqi Lu, Xingxing Tang, Gang Dai, Shuangping Huang, Lixin Fan, Qiang Yang
TL;DR
Federated Class Continual Learning (FCCL) suffers catastrophic forgetting under privacy constraints that limit experience replay. The authors introduce Diffusion-Driven Data Replay (DDDR), a two-phase framework combining Federated Class Inversion (FCI) with a frozen pre-trained Latent Diffusion Model to learn compact class embeddings and generate high-quality replay data, followed by Replay-Augmented Training that fuses generated and real data with contrastive learning and knowledge distillation. By transmitting only class embeddings and leveraging FedAvg for aggregation, DDDR achieves state-of-the-art performance on CIFAR-100 and Tiny-ImageNet across IID and non-IID settings, while reducing data leakage risk and computational overhead. The approach also shows robust generalization to real and generated data domains, suggesting practical viability for privacy-preserving FCCL deployments.
Abstract
Federated Class Continual Learning (FCCL) merges the challenges of distributed client learning with the need for seamless adaptation to new classes without forgetting old ones. The key challenge in FCCL is catastrophic forgetting, an issue that has been explored to some extent in Continual Learning (CL). However, due to privacy preservation requirements, some conventional methods, such as experience replay, are not directly applicable to FCCL. Existing FCCL methods mitigate forgetting by generating historical data through federated training of GANs or data-free knowledge distillation. However, these approaches often suffer from unstable training of generators or low-quality generated data, limiting their guidance for the model. To address this challenge, we propose a novel method of data replay based on diffusion models. Instead of training a diffusion model, we employ a pre-trained conditional diffusion model to reverse-engineer each class, searching the corresponding input conditions for each class within the model's input space, significantly reducing computational resources and time consumption while ensuring effective generation. Furthermore, we enhance the classifier's domain generalization ability on generated and real data through contrastive learning, indirectly improving the representational capability of generated data for real data. Comprehensive experiments demonstrate that our method significantly outperforms existing baselines. Code is available at https://github.com/jinglin-liang/DDDR.
