Debiased Novel Category Discovering and Localization
Juexiao Feng, Yuhong Yang, Yanchun Xie, Yaqian Li, Yandong Guo, Yuchen Guo, Yuwei He, Liuyu Xiang, Guiguang Ding
TL;DR
The paper tackles novel category discovery and localization in open-world object detection by addressing detector bias toward seen objects. It introduces Debiased Region Mining (DRM) with a dual RPN that jointly optimizes class-aware localization for knowns and class-agnostic localization for unknowns, coupled with semi-supervised instance-level contrastive finetuning and minibatch K-means clustering to discover and cluster new categories. The method achieves state-of-the-art performance on NCDL benchmarks by improving both localization accuracy and the quality of discovered unknown categories, while maintaining computational efficiency. This approach advances practical open-world detection by enabling simultaneous localization and clustering of novel objects without requiring exhaustive re-labeling.
Abstract
In recent years, object detection in deep learning has experienced rapid development. However, most existing object detection models perform well only on closed-set datasets, ignoring a large number of potential objects whose categories are not defined in the training set. These objects are often identified as background or incorrectly classified as pre-defined categories by the detectors. In this paper, we focus on the challenging problem of Novel Class Discovery and Localization (NCDL), aiming to train detectors that can detect the categories present in the training data, while also actively discover, localize, and cluster new categories. We analyze existing NCDL methods and identify the core issue: object detectors tend to be biased towards seen objects, and this leads to the neglect of unseen targets. To address this issue, we first propose an Debiased Region Mining (DRM) approach that combines class-agnostic Region Proposal Network (RPN) and class-aware RPN in a complementary manner. Additionally, we suggest to improve the representation network through semi-supervised contrastive learning by leveraging unlabeled data. Finally, we adopt a simple and efficient mini-batch K-means clustering method for novel class discovery. We conduct extensive experiments on the NCDL benchmark, and the results demonstrate that the proposed DRM approach significantly outperforms previous methods, establishing a new state-of-the-art.
