REP: Resource-Efficient Prompting for Rehearsal-Free Continual Learning
Sungho Jeon, Xinyue Ma, Kwang In Kim, Myeongjae Jeon
TL;DR
Continual learning on edge devices is hindered by the resource demands of prompt-based rehearsal-free methods. REP mitigates this by using a lightweight surrogate for prompt selection and two adaptive update techniques—adaptive token merging and adaptive layer dropping—to reduce compute and memory while preserving task-specific knowledge. The approach yields substantial reductions in training time and memory across multiple datasets and ViT backbones, with only marginal accuracy loss, and extends to non-prompting methods and adapters. Overall, REP enables practical, on-device continual learning with vision transformers by delivering substantial efficiency gains without compromising core performance.
Abstract
Recent rehearsal-free continual learning (CL) methods guided by prompts achieve strong performance on vision tasks with non-stationary data but remain resource-intensive, hindering real-world edge deployment. We introduce resource-efficient prompting (REP), which improves the computational and memory efficiency of prompt-based rehearsal-free continual learning methods while minimizing accuracy trade-offs. Our approach employs swift prompt selection to refine input data using a carefully provisioned model and introduces adaptive token merging (AToM) and adaptive layer dropping (ALD) for efficient prompt updates. AToM and ALD selectively skip data and model layers while preserving task-specific features during the learning of new tasks. Extensive experiments on multiple image classification datasets demonstrate REP's superior resource efficiency over state-of-the-art rehearsal-free CL methods.
