RRCANet: Recurrent Reusable-Convolution Attention Network for Infrared Small Target Detection
Yongxian Liu, Boyang Li, Ting Liu, Zaiping Lin, Wei An
TL;DR
This work tackles infrared small target detection by introducing RRCA-Net, a lightweight network that achieves high precision with minimal parameters. It leverages a recurrent reusable-convolution block (RuCB) to iteratively refine features without adding new kernels, and a dual interactive attention aggregation module (DIAAM) to fuse multi-scale information efficiently. The loss function, DpT-k, combines Dice, Poly, and Top-k terms to promote shape accuracy and focus on hard samples, improving convergence and reducing false alarms. Evaluations on NUAA-SIRST, IRSTD-1k, and DenseSIRST demonstrate competitive accuracy and robustness, with the added benefit of transferability as a plug-and-play component for other IRSTD methods.
Abstract
Infrared small target detection is a challenging task due to its unique characteristics (e.g., small, dim, shapeless and changeable). Recently published CNN-based methods have achieved promising performance with heavy feature extraction and fusion modules. To achieve efficient and effective detection, we propose a recurrent reusable-convolution attention network (RRCA-Net) for infrared small target detection. Specifically, RRCA-Net incorporates reusable-convolution block (RuCB) in a recurrent manner without introducing extra parameters. With the help of the repetitive iteration in RuCB, the high-level information of small targets in the deep layers can be well maintained and further refined. Then, a dual interactive attention aggregation module (DIAAM) is proposed to promote the mutual enhancement and fusion of refined information. In this way, RRCA-Net can both achieve high-level feature refinement and enhance the correlation of contextual information between adjacent layers. Moreover, to achieve steady convergence, we design a target characteristic inspired loss function (DpT-k loss) by integrating physical and mathematical constraints. Experimental results on three benchmark datasets (e.g. NUAA-SIRST, IRSTD-1k, DenseSIRST) demonstrate that our RRCA-Net can achieve comparable performance to the state-of-the-art methods while maintaining a small number of parameters, and act as a plug and play module to introduce consistent performance improvement for several popular IRSTD methods.
