Collaborative Feature-Logits Contrastive Learning for Open-Set Semi-Supervised Object Detection
Xinhao Zhong, Siyu Jiao, Yao Zhao, Yunchao Wei
TL;DR
OSSOD presents a practical challenge where unlabeled data contain both ID and OOD objects, risking misclassification. The paper introduces CFL-Detector, a collaborative framework that jointly optimizes a feature-space contrastive loss $\mathcal{L}_{fc}$ and a logits-space uncertainty loss $\mathcal{L}_{uc}$ within a two-stage teacher–student training scheme, leveraging a memory pool of embeddings and uncertainty weighting to separate ID from OOD. This approach yields a robust detection system that marks OOD as 'unknown' and preserves ID accuracy, demonstrated across COCO Open-CLS, COCO Open-SUP, and VOC-COCO benchmarks with state-of-the-art results. While effective, the method may incur a slight decrease in ID performance due to the emphasis on OOD separation, indicating potential trade-offs to balance in future work.
Abstract
Current Semi-Supervised Object Detection (SSOD) methods enhance detector performance by leveraging large amounts of unlabeled data, assuming that both labeled and unlabeled data share the same label space. However, in open-set scenarios, the unlabeled dataset contains both in-distribution (ID) classes and out-of-distribution (OOD) classes. Applying semi-supervised detectors in such settings can lead to misclassifying OOD class as ID classes. To alleviate this issue, we propose a simple yet effective method, termed Collaborative Feature-Logits Detector (CFL-Detector). Specifically, we introduce a feature-level clustering method using contrastive loss to clarify vector boundaries in the feature space and highlight class differences. Additionally, by optimizing the logits-level uncertainty classification loss, the model enhances its ability to effectively distinguish between ID and OOD classes. Extensive experiments demonstrate that our method achieves state-of-the-art performance compared to existing methods.
