HS-FPN: High Frequency and Spatial Perception FPN for Tiny Object Detection
Zican Shi, Jing Hu, Jie Ren, Hengkang Ye, Xuyang Yuan, Yan Ouyang, Jia He, Bo Ji, Junyu Guo
TL;DR
This work addresses the persistent challenge of tiny object detection by augmenting Feature Pyramid Networks with two novel modules: High Frequency Perception (HFP) and Spatial Dependency Perception (SDP). HFP enriches tiny-object features by applying a high-pass filter to extract high-frequency content and deriving channel- and spatial-wise attention from these high-frequency responses; SDP introduces pixel-level cross-attention between adjacent feature maps to capture spatial dependencies that standard FPNs miss. Ablation studies show that both modules individually improve detection and especially when combined, yielding substantial gains over FPN across multiple detectors on AI-TOD and DOTA-mini10. The approach delivers consistent improvements with a modest increase in computational cost, and can be easily embedded into existing FPN-based architectures, enhancing robustness for tiny-object scenarios with practical impact for aerial imagery and other TOD applications.
Abstract
The introduction of Feature Pyramid Network (FPN) has significantly improved object detection performance. However, substantial challenges remain in detecting tiny objects, as their features occupy only a very small proportion of the feature maps. Although FPN integrates multi-scale features, it does not directly enhance or enrich the features of tiny objects. Furthermore, FPN lacks spatial perception ability. To address these issues, we propose a novel High Frequency and Spatial Perception Feature Pyramid Network (HS-FPN) with two innovative modules. First, we designed a high frequency perception module (HFP) that generates high frequency responses through high pass filters. These high frequency responses are used as mask weights from both spatial and channel perspectives to enrich and highlight the features of tiny objects in the original feature maps. Second, we developed a spatial dependency perception module (SDP) to capture the spatial dependencies that FPN lacks. Our experiments demonstrate that detectors based on HS-FPN exhibit competitive advantages over state-of-the-art models on the AI-TOD dataset for tiny object detection.
