Similarity Distance-Based Label Assignment for Tiny Object Detection
Shuohao Shi, Qiang Fang, Tong Zhao, Xin Xu
TL;DR
This work tackles the difficulty of tiny object detection by rethinking label assignment with a hyperparameter-free Similarity Distance (SimD) that jointly captures location and shape similarity between bounding boxes. SimD embeds adaptive normalization via dataset-derived parameters and replaces IoU in both label assignment (MaxSimDAssigner) and NMS, enabling better high-quality positive sampling without inflating false positives. Extensive experiments across AI-TOD, AI-TODv2, VisDrone2019, and SODA-D show state-of-the-art gains, particularly for very tiny objects, with substantial AP improvements over traditional IoU-based methods. The approach is architecture-agnostic for anchor-based detectors and comes with publicly available code, highlighting practical impact for robust tiny-object detection in diverse scenes.
Abstract
Tiny object detection is becoming one of the most challenging tasks in computer vision because of the limited object size and lack of information. The label assignment strategy is a key factor affecting the accuracy of object detection. Although there are some effective label assignment strategies for tiny objects, most of them focus on reducing the sensitivity to the bounding boxes to increase the number of positive samples and have some fixed hyperparameters need to set. However, more positive samples may not necessarily lead to better detection results, in fact, excessive positive samples may lead to more false positives. In this paper, we introduce a simple but effective strategy named the Similarity Distance (SimD) to evaluate the similarity between bounding boxes. This proposed strategy not only considers both location and shape similarity but also learns hyperparameters adaptively, ensuring that it can adapt to different datasets and various object sizes in a dataset. Our approach can be simply applied in common anchor-based detectors in place of the IoU for label assignment and Non Maximum Suppression (NMS). Extensive experiments on four mainstream tiny object detection datasets demonstrate superior performance of our method, especially, 1.8 AP points and 4.1 AP points of very tiny higher than the state-of-the-art competitors on AI-TOD. Code is available at: \url{https://github.com/cszzshi/SimD}.
