Replay Consolidation with Label Propagation for Continual Object Detection
Riccardo De Monte, Davide Dalle Pezze, Marina Ceccon, Francesco Pasti, Francesco Paissan, Elisabetta Farella, Gian Antonio Susto, Nicola Bellotto
TL;DR
This work tackles continual learning for object detection, where missing annotations cause task interference in replay-based methods. The proposed Replay Consolidation with Label Propagation for Object Detection (RCLPOD) enriches replay memory with Label Propagation to attach pseudo-labels for old classes and to propagate knowledge into memory, while balancing memory via OCDM and reducing interference with a masking loss $L_{cls-mask}$ and stabilizing learning with $L_{feat-dist}$ across YOLOv8's backbone and neck features. The approach delivers state-of-the-art results on VOC and COCO CL benchmarks, demonstrating strong performance in long task sequences and favorable stability-plasticity trade-offs without increasing memory footprint. The solution is architecture-agnostic and suitable for real-world systems such as autonomous driving and robotics, offering a practical, memory-efficient path for continual object detection with modern detectors like YOLOv8.
Abstract
Continual Learning (CL) aims to learn new data while remembering previously acquired knowledge. In contrast to CL for image classification, CL for Object Detection faces additional challenges such as the missing annotations problem. In this scenario, images from previous tasks may contain instances of unknown classes that could reappear as labeled in future tasks, leading to task interference in replay-based approaches. Consequently, most approaches in the literature have focused on distillation-based techniques, which are effective when there is a significant class overlap between tasks. In our work, we propose an alternative to distillation-based approaches with a novel approach called Replay Consolidation with Label Propagation for Object Detection (RCLPOD). RCLPOD enhances the replay memory by improving the quality of the stored samples through a technique that promotes class balance while also improving the quality of the ground truth associated with these samples through a technique called label propagation. RCLPOD outperforms existing techniques on well-established benchmarks such as VOC and COC. Moreover, our approach is developed to work with modern architectures like YOLOv8, making it suitable for dynamic, real-world applications such as autonomous driving and robotics, where continuous learning and resource efficiency are essential.
