Beyond Background Shift: Rethinking Instance Replay in Continual Semantic Segmentation
Hongmei Yin, Tingliang Feng, Fan Lyu, Fanhua Shang, Hongying Liu, Wei Feng, Liang Wan
TL;DR
This work tackles continual semantic segmentation (CSS) by addressing background shift caused by partial labeling. It introduces Enhanced Instance Replay (EIR), a full pipeline that stores old-class instances, selectively combines them with new images based on contextual predictions, places them strategically in the image background, and trains with a Region-Specific Knowledge Distillation (RSKD) loss. The approach yields substantial improvements over state-of-the-art CSS methods on Pascal VOC 2012 and ADE20K, demonstrating stronger balance between retaining old knowledge and learning new classes. By mitigating background confusion in both stored and new images, EIR advances practical CSS capabilities and offers a scalable framework for future improvements in instance-aware continual learning.
Abstract
In this work, we focus on continual semantic segmentation (CSS), where segmentation networks are required to continuously learn new classes without erasing knowledge of previously learned ones. Although storing images of old classes and directly incorporating them into the training of new models has proven effective in mitigating catastrophic forgetting in classification tasks, this strategy presents notable limitations in CSS. Specifically, the stored and new images with partial category annotations leads to confusion between unannotated categories and the background, complicating model fitting. To tackle this issue, this paper proposes a novel Enhanced Instance Replay (EIR) method, which not only preserves knowledge of old classes while simultaneously eliminating background confusion by instance storage of old classes, but also mitigates background shifts in the new images by integrating stored instances with new images. By effectively resolving background shifts in both stored and new images, EIR alleviates catastrophic forgetting in the CSS task, thereby enhancing the model's capacity for CSS. Experimental results validate the efficacy of our approach, which significantly outperforms state-of-the-art CSS methods.
