DCA: Dividing and Conquering Amnesia in Incremental Object Detection

Aoting Zhang; Dongbao Yang; Chang Liu; Xiaopeng Hong; Miao Shang; Yu Zhou

DCA: Dividing and Conquering Amnesia in Incremental Object Detection

Aoting Zhang, Dongbao Yang, Chang Liu, Xiaopeng Hong, Miao Shang, Yu Zhou

TL;DR

This work addresses catastrophic forgetting in incremental object detection by identifying a forgetting imbalance: localization is relatively stable and class-agnostic, while recognition forgets severely as new classes are added. It introduces Divide-and-Conquer Amnesia (DCA), a localization-then-recognition framework that decouples the detector into two branches, preserving localization while guiding recognition with semantic knowledge from pre-trained language models. Key innovations include a semantic-guided recognition decoder, duplex classifier fusion, and Hybrid Knowledge Distillation to curb feature drift without storing old exemplars. Experiments on VOC and COCO show state-of-the-art performance, especially in long-term incremental scenarios, with exemplar-free overhead and strong robustness to language-model choices. Overall, DCA provides a scalable, semantics-driven path to robust continual object detection.

Abstract

Incremental object detection (IOD) aims to cultivate an object detector that can continuously localize and recognize novel classes while preserving its performance on previous classes. Existing methods achieve certain success by improving knowledge distillation and exemplar replay for transformer-based detection frameworks, but the intrinsic forgetting mechanisms remain underexplored. In this paper, we dive into the cause of forgetting and discover forgetting imbalance between localization and recognition in transformer-based IOD, which means that localization is less-forgetting and can generalize to future classes, whereas catastrophic forgetting occurs primarily on recognition. Based on these insights, we propose a Divide-and-Conquer Amnesia (DCA) strategy, which redesigns the transformer-based IOD into a localization-then-recognition process. DCA can well maintain and transfer the localization ability, leaving decoupled fragile recognition to be specially conquered. To reduce feature drift in recognition, we leverage semantic knowledge encoded in pre-trained language models to anchor class representations within a unified feature space across incremental tasks. This involves designing a duplex classifier fusion and embedding class semantic features into the recognition decoding process in the form of queries. Extensive experiments validate that our approach achieves state-of-the-art performance, especially for long-term incremental scenarios. For example, under the four-step setting on MS-COCO, our DCA strategy significantly improves the final AP by 6.9%.

DCA: Dividing and Conquering Amnesia in Incremental Object Detection

TL;DR

Abstract

DCA: Dividing and Conquering Amnesia in Incremental Object Detection

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)