Background Semantics Matter: Cross-Task Feature Exchange Network for Clustered Infrared Small Target Detection
Mengxuan Xiao, Yinfei Zhu, Yiming Zhu, Boyang Li, Feifei Zhang, Huan Wang, Meng Cai, Yimian Dai
TL;DR
This work tackles the challenge of detecting densely clustered infrared small targets by leveraging background semantics. It introduces DenseSIRST, a dataset with per-pixel background annotations, and BAFE-Net, a multi-task architecture that jointly performs target detection and background semantic segmentation via a dynamic cross-task feature exchange mechanism. The method employs BAG-CP to synthesize realistic densely clustered scenes and demonstrates improved detection accuracy with reduced false alarms across DenseSIRST and existing IRSTD benchmarks. Collectively, the approach highlights the importance of contextual information and explicit background modeling for robust infrared small target detection in complex environments.
Abstract
Infrared small target detection presents significant challenges due to the limited intrinsic features of the target and the overwhelming presence of visually similar background distractors. We contend that background semantics are critical for distinguishing between objects that appear visually similar in this context. To address this challenge, we propose a task, clustered infrared small target detection, and introduce DenseSIRST, a benchmark dataset that provides per-pixel semantic annotations for background regions. This dataset facilitates the shift from sparse to dense target detection. This dataset facilitates the shift from sparse to dense target detection. Building on this resource, we propose the Background-Aware Feature Exchange Network (BAFE-Net), a multi-task architecture that jointly tackles target detection and background semantic segmentation. BAFE-Net incorporates a dynamic cross-task feature hard-exchange mechanism, enabling the effective exchange of target and background semantics between the two tasks. Comprehensive experiments demonstrate that BAFE-Net significantly enhances target detection accuracy while mitigating false alarms. The DenseSIRST dataset, along with the code and trained models, is publicly available at https://github.com/GrokCV/BAFE-Net.
