LightCL: Compact Continual Learning with Low Memory Footprint For Edge Device
Zeqing Wang, Fei Cheng, Kangye Ji, Bohu Huang
TL;DR
LightCL tackles edge-device continual learning by measuring generalizability across network layers with two metrics, learning plasticity ($LP$) and memory stability ($MS$). It then maintains performance by freezing generalizable lower/middle layers (Maintain Generalizability) and memorizes past task feature patterns via a lightweight feature-map regulation (Memorize Feature Patterns). The approach achieves substantial memory-footprint reductions (up to about $6\times$) while maintaining or improving accuracy, and is validated on edge hardware. This work enables practical, on-device CL for resource-constrained environments.
Abstract
Continual learning (CL) is a technique that enables neural networks to constantly adapt to their dynamic surroundings. Despite being overlooked for a long time, this technology can considerably address the customized needs of users in edge devices. Actually, most CL methods require huge resource consumption by the training behavior to acquire generalizability among all tasks for delaying forgetting regardless of edge scenarios. Therefore, this paper proposes a compact algorithm called LightCL, which evaluates and compresses the redundancy of already generalized components in structures of the neural network. Specifically, we consider two factors of generalizability, learning plasticity and memory stability, and design metrics of both to quantitatively assess generalizability of neural networks during CL. This evaluation shows that generalizability of different layers in a neural network exhibits a significant variation. Thus, we $\textit{Maintain Generalizability}$ by freezing generalized parts without the resource-intensive training process and $\textit{Memorize Feature Patterns}$ by stabilizing feature extracting of previous tasks to enhance generalizability for less-generalized parts with a little extra memory, which is far less than the reduction by freezing. Experiments illustrate that LightCL outperforms other state-of-the-art methods and reduces at most $\textbf{6.16$\times$}$ memory footprint. We also verify the effectiveness of LightCL on the edge device.
