Task-conditioned Ensemble of Expert Models for Continuous Learning
Renu Sharma, Debasmita Pal, Arun Ross
TL;DR
This work tackles continual learning under distribution shifts by introducing a task-conditioned ensemble of expert models augmented with an in-domain model that estimates task membership. The in-domain component uses a Vision Transformer feature extractor trained with center and mean-shifted intra-class losses, plus a LOF-based distance measure to generate membership scores, enabling a dynamic fusion of task-specific experts via $s = s_1 m_1 + s_2 m_2$. Across LivDet iris datasets and Split MNIST, the approach delivers strong retention of old-task performance and competitive accuracy under various shifts while reducing memory demands compared with replay-based methods. The results highlight effective membership allocation, robust representations, and scalable paths through distillation or model merging to handle growing task pools.
Abstract
One of the major challenges in machine learning is maintaining the accuracy of the deployed model (e.g., a classifier) in a non-stationary environment. The non-stationary environment results in distribution shifts and, consequently, a degradation in accuracy. Continuous learning of the deployed model with new data could be one remedy. However, the question arises as to how we should update the model with new training data so that it retains its accuracy on the old data while adapting to the new data. In this work, we propose a task-conditioned ensemble of models to maintain the performance of the existing model. The method involves an ensemble of expert models based on task membership information. The in-domain models-based on the local outlier concept (different from the expert models) provide task membership information dynamically at run-time to each probe sample. To evaluate the proposed method, we experiment with three setups: the first represents distribution shift between tasks (LivDet-Iris-2017), the second represents distribution shift both between and within tasks (LivDet-Iris-2020), and the third represents disjoint distribution between tasks (Split MNIST). The experiments highlight the benefits of the proposed method. The source code is available at https://github.com/iPRoBe-lab/Continuous_Learning_FE_DM.
