Class-balanced Open-set Semi-supervised Object Detection for Medical Images
Zhanyun Lu, Renshu Gu, Huimin Cheng, Siyu Pang, Mingyu Xu, Peifang Xu, Yaqi Wang, Yuichiro Kinoshita, Juan Ye, Gangyong Jia, Qing Wu
TL;DR
The paper tackles open-set semi-supervised object detection (OSSOD) in medical imaging, where unlabeled data may contain unknown disease categories and class imbalance is severe. It introduces two modules, Out-of-distribution Detection Fusion Classifier (OODFC) and Category Control Embed (CCE), integrated into a Mean-Teacher SSOD framework to mitigate unknown-category interference and balance foreground categories. OODFC merges unknown-class pseudo-labels with known ones using a dynamic threshold based on category AP, while CCE constructs a Foreground Information Library and synthesizes balanced foreground data via $x_i^{Syn}=\beta b_j^{'}+(1-\beta)x_i$, $y_i^{Syn}=y_j$, with $\beta=0.5$, all within an EMA-based training regime. Experiments on private ophthalmology and public Parasite datasets show consistent mAP gains, including a 4.25-point improvement on Parasite, demonstrating effective open-set handling and class balance with practical medical-imaging impact.
Abstract
Medical image datasets in the real world are often unlabeled and imbalanced, and Semi-Supervised Object Detection (SSOD) can utilize unlabeled data to improve an object detector. However, existing approaches predominantly assumed that the unlabeled data and test data do not contain out-of-distribution (OOD) classes. The few open-set semi-supervised object detection methods have two weaknesses: first, the class imbalance is not considered; second, the OOD instances are distinguished and simply discarded during pseudo-labeling. In this paper, we consider the open-set semi-supervised object detection problem which leverages unlabeled data that contain OOD classes to improve object detection for medical images. Our study incorporates two key innovations: Category Control Embed (CCE) and out-of-distribution Detection Fusion Classifier (OODFC). CCE is designed to tackle dataset imbalance by constructing a Foreground information Library, while OODFC tackles open-set challenges by integrating the ``unknown'' information into basic pseudo-labels. Our method outperforms the state-of-the-art SSOD performance, achieving a 4.25 mAP improvement on the public Parasite dataset.
