Anno-incomplete Multi-dataset Detection
Yiran Xu, Haoxiang Zhong, Kai Wu, Jialin Li, Yong Liu, Chengjie Wang, Shu-Tao Xia, Hongen Liao
TL;DR
This work tackles the practical problem of detecting all object categories across multiple datasets with incomplete annotations and heterogeneous features. It introduces a branch-interactive detector built on FCOS, enhanced by an Attention-based Feature Interactor (AFI) and a Knowledge Amalgamation (KA) training strategy that leverages teacher models, feature alignment, distillation, and pseudo-label supervision. Empirical results on COCO and VOC show consistent improvements over strong baselines, demonstrating the method's ability to fuse information across datasets with limited cross-dataset labels and imbalanced data. The approach offers a scalable path toward unified detection across diverse data sources, with potential to extend to additional datasets and more complex multi-domain settings without requiring exhaustive re-annotation.
Abstract
Object detectors have shown outstanding performance on various public datasets. However, annotating a new dataset for a new task is usually unavoidable in real, since 1) a single existing dataset usually does not contain all object categories needed; 2) using multiple datasets usually suffers from annotation incompletion and heterogeneous features. We propose a novel problem as "Annotation-incomplete Multi-dataset Detection", and develop an end-to-end multi-task learning architecture which can accurately detect all the object categories with multiple partially annotated datasets. Specifically, we propose an attention feature extractor which helps to mine the relations among different datasets. Besides, a knowledge amalgamation training strategy is incorporated to accommodate heterogeneous features from different sources. Extensive experiments on different object detection datasets demonstrate the effectiveness of our methods and an improvement of 2.17%, 2.10% in mAP can be achieved on COCO and VOC respectively.
