CLIMB-3D: Continual Learning for Imbalanced 3D Instance Segmentation
Vishal Thengane, Jean Lahoud, Hisham Cholakkal, Rao Muhammad Anwer, Lu Yin, Xiatian Zhu, Salman Khan
TL;DR
The paper tackles the problem of forgetting and imbalance in class-incremental 3D instance segmentation by proposing CLIMB-3D, a modular framework that integrates Exemplar Replay (ER), a Pseudo-Label Generator (PLG), and Class-Balanced Re-weighting (CBR). It formulates CI-3DIS with disjoint class introductions across tasks and uses top-$K$ pseudo-labels from a frozen model plus class-frequency-based re-weighting to maintain past knowledge while learning new classes, all under memory constraints. Three benchmarking scenarios on ScanNet200 are introduced to reflect frequency, semantic similarity, and random grouping of categories, and the method achieves state-of-the-art gains, including up to 16.76% mAP improvements in 3D instance segmentation and approximately 30% mIoU gains in 3D semantic segmentation. The findings demonstrate robust learning across both frequent and rare classes and highlight the practical impact for real-world continual 3D scene understanding, with code available at the authors' GitHub repository.
Abstract
While 3D instance segmentation (3DIS) has advanced significantly, most existing methods assume that all object classes are known in advance and uniformly distributed. However, this assumption is unrealistic in dynamic, real-world environments where new classes emerge gradually and exhibit natural imbalance. Although some approaches address the emergence of new classes, they often overlook class imbalance, which leads to suboptimal performance, particularly on rare categories. To tackle this, we propose \ourmethodbf, a unified framework for \textbf{CL}ass-incremental \textbf{Imb}alance-aware \textbf{3D}IS. Building upon established exemplar replay (ER) strategies, we show that ER alone is insufficient to achieve robust performance under memory constraints. To mitigate this, we introduce a novel pseudo-label generator (PLG) that extends supervision to previously learned categories by leveraging predictions from a frozen model trained on prior tasks. Despite its promise, PLG tends to be biased towards frequent classes. Therefore, we propose a class-balanced re-weighting (CBR) scheme that estimates object frequencies from pseudo-labels and dynamically adjusts training bias, without requiring access to past data. We design and evaluate three incremental scenarios for 3DIS on the challenging ScanNet200 dataset and additionally validate our method for semantic segmentation on ScanNetV2. Our approach achieves state-of-the-art results, surpassing prior work by up to 16.76\% mAP for instance segmentation and approximately 30\% mIoU for semantic segmentation, demonstrating strong generalisation across both frequent and rare classes. Code is available at: https://github.com/vgthengane/CLIMB3D
