Ferret: An Efficient Online Continual Learning Framework under Varying Memory Constraints
Yuhao Zhou, Yuxin Tian, Jindi Lv, Mingjia Shi, Yuanxi Li, Qing Ye, Shuhao Zhang, Jiancheng Lv
TL;DR
Ferret tackles real-time online continual learning under varying memory budgets by marrying fine-grained pipeline parallelism with an iterative gradient compensation scheme. It jointly optimizes model partitioning and pipeline planning through a bi-level framework, enabling memory-aware throughput while mitigating gradient staleness with a Taylor-series-based estimator and diagonal Fisher approximation. Empirical results across 20 benchmarks and 5 integrated OCL algorithms show Ferret achieving up to $3.7\times$ lower memory overhead to reach the same online accuracy and robust performance across diverse memory budgets, with the Iter-Fisher method providing automatic, data-adaptive compensation. This framework advances practical, scalable OCL in real-time environments by balancing latency, throughput, and memory usage, delivering significant gains in online adaptation without sacrificing stability.
Abstract
In the realm of high-frequency data streams, achieving real-time learning within varying memory constraints is paramount. This paper presents Ferret, a comprehensive framework designed to enhance online accuracy of Online Continual Learning (OCL) algorithms while dynamically adapting to varying memory budgets. Ferret employs a fine-grained pipeline parallelism strategy combined with an iterative gradient compensation algorithm, ensuring seamless handling of high-frequency data with minimal latency, and effectively counteracting the challenge of stale gradients in parallel training. To adapt to varying memory budgets, its automated model partitioning and pipeline planning optimizes performance regardless of memory limitations. Extensive experiments across 20 benchmarks and 5 integrated OCL algorithms show Ferret's remarkable efficiency, achieving up to 3.7$\times$ lower memory overhead to reach the same online accuracy compared to competing methods. Furthermore, Ferret consistently outperforms these methods across diverse memory budgets, underscoring its superior adaptability. These findings position Ferret as a premier solution for efficient and adaptive OCL framework in real-time environments.
