Rethinking Open-Set Object Detection: Issues, a New Formulation, and Taxonomy
Yusuke Hosoya, Masanori Suganuma, Takayuki Okatani
TL;DR
This paper critiques the prevailing OSOD formulations and introduces OSOD-III, a practical open-set detection framework that is open at the class level but closed at the super-class level. By benchmarking OSOD-III on Open Images, CUB200, and MTSD, the authors show that existing OSOD-II methods poorly detect unknowns when evaluated with standard AP metrics, and that simple baselines can be competitive. They also reveal evaluation pitfalls in prior metrics (A-OSE, WI) and provide a taxonomy clarifying OSOD terminology. The work highlights a clear path for progress: define targets within a superclass, evaluate with AP on unknowns, and investigate detector behavior and NMS strategies to reduce confusion between known and unknown detections. Overall, OSOD-III offers a realistic, testable formulation that is likely to steer practical progress in open-set object detection.
Abstract
Open-set object detection (OSOD), a task involving the detection of unknown objects while accurately detecting known objects, has recently gained attention. However, we identify a fundamental issue with the problem formulation employed in current OSOD studies. Inherent to object detection is knowing "what to detect," which contradicts the idea of identifying "unknown" objects. This sets OSOD apart from open-set recognition (OSR). This contradiction complicates a proper evaluation of methods' performance, a fact that previous studies have overlooked. Next, we propose a novel formulation wherein detectors are required to detect both known and unknown classes within specified super-classes of object classes. This new formulation is free from the aforementioned issues and has practical applications. Finally, we design benchmark tests utilizing existing datasets and report the experimental evaluation of existing OSOD methods. The results show that existing methods fail to accurately detect unknown objects due to misclassification of known and unknown classes rather than incorrect bounding box prediction. As a byproduct, we introduce a taxonomy of OSOD, resolving confusion prevalent in the literature. We anticipate that our study will encourage the research community to reconsider OSOD and facilitate progress in the right direction.
