Incremental Learning of Retrievable Skills For Efficient Continual Task Adaptation
Daehee Lee, Minjong Yoo, Woo Kyung Kim, Wonje Choi, Honguk Woo
TL;DR
This work tackles continual imitation learning (CiL) by addressing the limited knowledge sharing of existing adapter-based methods. It introduces IsCiL, which stores and incrementally learns shareable skills as prototypes and per-skill adapters, enabling retrieval-based composition of actions for new tasks and non-stationary environments. The approach uses a fixed state encoder, a prototype-based skill retriever, and a LoRA-conditioned skill decoder to maintain stability while allowing scalable adaptation; it also supports task unlearning by removing specific skill prototypes and adapters. Empirical results on Franka-Kitchen and Meta-World demonstrate improved sample efficiency and robust task adaptation across Complete, Semi-complete, and Incomplete streams, with effective handling of unseen tasks and privacy-preserving unlearning. Overall, IsCiL narrows the gap between adapter-based CiL and cross-task knowledge sharing, delivering practical benefits for continual task learning in complex, long-horizon domains.
Abstract
Continual Imitation Learning (CiL) involves extracting and accumulating task knowledge from demonstrations across multiple stages and tasks to achieve a multi-task policy. With recent advancements in foundation models, there has been a growing interest in adapter-based CiL approaches, where adapters are established parameter-efficiently for tasks newly demonstrated. While these approaches isolate parameters for specific tasks and tend to mitigate catastrophic forgetting, they limit knowledge sharing among different demonstrations. We introduce IsCiL, an adapter-based CiL framework that addresses this limitation of knowledge sharing by incrementally learning shareable skills from different demonstrations, thus enabling sample-efficient task adaptation using the skills particularly in non-stationary CiL environments. In IsCiL, demonstrations are mapped into the state embedding space, where proper skills can be retrieved upon input states through prototype-based memory. These retrievable skills are incrementally learned on their corresponding adapters. Our CiL experiments with complex tasks in Franka-Kitchen and Meta-World demonstrate robust performance of IsCiL in both task adaptation and sample-efficiency. We also show a simple extension of IsCiL for task unlearning scenarios.
