Active Continual Learning: On Balancing Knowledge Retention and Learnability
Thuy-Trang Vu, Shahram Khadivi, Mahsa Ghorbanali, Dinh Phung, Gholamreza Haffari
TL;DR
This work formalizes active continual learning (ACL), studying how selective annotation within an annotation budget interacts with continual learning (CL) across domain-, class-, and task-incremental settings. It introduces key metrics, notably forgetting rate $FR$ and learning-curve area $LCA$, and proposes the forgetting-learning profile to diagnose when ACL methods trade rapid learning for memory retention. Through extensive experiments on image and text classification, the authors show that ACL with partial labeling can match or approach full-data CL in domain-IL, with experience replay (ER) often providing robust retention, while class-IL remains challenging for ideal ACL placements. The findings offer practical guidelines for selecting AL and CL algorithms in ACL and highlight the need for strategies that balance old-task retention with fast acquisition of new knowledge, especially in non-domain IL scenarios.
Abstract
Acquiring new knowledge without forgetting what has been learned in a sequence of tasks is the central focus of continual learning (CL). While tasks arrive sequentially, the training data are often prepared and annotated independently, leading to the CL of incoming supervised learning tasks. This paper considers the under-explored problem of active continual learning (ACL) for a sequence of active learning (AL) tasks, where each incoming task includes a pool of unlabelled data and an annotation budget. We investigate the effectiveness and interplay between several AL and CL algorithms in the domain, class and task-incremental scenarios. Our experiments reveal the trade-off between two contrasting goals of not forgetting the old knowledge and the ability to quickly learn new knowledge in CL and AL, respectively. While conditioning the AL query strategy on the annotations collected for the previous tasks leads to improved task performance on the domain and task incremental learning, our proposed forgetting-learning profile suggests a gap in balancing the effect of AL and CL for the class-incremental scenario.
