Parameterizing Federated Continual Learning for Reproducible Research
Bart Cox, Jeroen Galjaard, Aditya Shankar, Jérémie Decouchant, Lydia Y. Chen
TL;DR
Frederick tackles reproducible Federated Continual Learning in heterogeneous and evolving environments by introducing Freddie, a Kubernetes-based, open-source framework that supports data and task heterogeneity, Task-IL and Domain-IL, and scalable experiments via Kubeflow training operators. It identifies essential requirements for FCL emulation (usability, reproducibility, complex workloads, resource heterogeneity) and provides novel training orchestration and extraction components along with three task-partition schemes (Column, Balanced, Shuffled). The paper demonstrates Freddie on CIFAR100 with large-scale FL and heterogeneous task sequences, revealing significant forgetting under certain workloads and highlighting practical considerations like co-scheduling effects and runtime variability. Overall, Freddie enables reproducible, scalable FCL research and serves as a foundation for evaluating, benchmarking, and extending Federated Continual Learning in realistic environments.
Abstract
Federated Learning (FL) systems evolve in heterogeneous and ever-evolving environments that challenge their performance. Under real deployments, the learning tasks of clients can also evolve with time, which calls for the integration of methodologies such as Continual Learning. To enable research reproducibility, we propose a set of experimental best practices that precisely capture and emulate complex learning scenarios. Our framework, Freddie, is the first entirely configurable framework for Federated Continual Learning (FCL), and it can be seamlessly deployed on a large number of machines thanks to the use of Kubernetes and containerization. We demonstrate the effectiveness of Freddie on two use cases, (i) large-scale FL on CIFAR100 and (ii) heterogeneous task sequence on FCL, which highlight unaddressed performance challenges in FCL scenarios.
