Enhancing Evaluation Methods for Infrared Small-Target Detection in Real-world Scenarios
Saed Moradi, Alireza Memarmoghadam, Payman Moallem, Mohamad Farzan Sabahi
TL;DR
This work targets the gap between conventional evaluation metrics and real-world performance in infrared small target detection (IRSTD) by analyzing the pivotal role of thresholding. It argues that global thresholding, coupled with metrics tailored to practical detection, yields more meaningful assessments than local thresholding or traditional ROC-based post-thresholding measures. The authors introduce a unified post-thresholding evaluation workflow centered on a maximum threshold parameter $k_{\max}$ and a corresponding $P_{fa,\min}$, alongside adapting pre-thresholding metrics like $SCR$ and $BSF$ for alignment with real systems. They validate the approach on five widely used IRSTD algorithms, finding that $ADMD$ and $NIPPS$ most consistently enhance target regions while controlling false alarms, with the new metrics aligning with qualitative observations. The proposed framework offers a practical, threshold-aware benchmark for IRSTD research and deployment.
Abstract
Infrared small target detection (IRSTD) poses a significant challenge in the field of computer vision. While substantial efforts have been made over the past two decades to improve the detection capabilities of IRSTD algorithms, there has been a lack of extensive investigation into the evaluation metrics used for assessing their performance. In this paper, we employ a systematic approach to address this issue by first evaluating the effectiveness of existing metrics and then proposing new metrics to overcome the limitations of conventional ones. To achieve this, we carefully analyze the necessary conditions for successful detection and identify the shortcomings of current evaluation metrics, including both pre-thresholding and post-thresholding metrics. We then introduce new metrics that are designed to align with the requirements of real-world systems. Furthermore, we utilize these newly proposed metrics to compare and evaluate the performance of five widely recognized small infrared target detection algorithms. The results demonstrate that the new metrics provide consistent and meaningful quantitative assessments, aligning with qualitative observations.
