Table of Contents
Fetching ...

Hybrid Mask Generation for Infrared Small Target Detection with Single-Point Supervision

Weijie He, Mushui Liu, Yunlong Yu

TL;DR

This work tackles infrared small target detection with single-point supervision by introducing Hybrid Mask Generation (HMG), which combines a learning-free, size-adaptive Point-to-Mask Generation (PMG) with a learning-based updating framework. PMG converts point annotations into bounding boxes (Point-to-Box) and then into initial pseudo masks (Box-to-Mask) using region-based thresholds and directional probability maps, while Pseudo Mask Updating (FAF and MDR) merges neural predictions with these masks to produce high-quality, hybrid supervision. Across three SIRST datasets, HMG achieves IoU gains over prior single-point methods and approaches fully supervised performance in some configurations, while maintaining low false-alarm rates and robust performance under real-world perturbations. The method reduces annotation costs and provides a practical, scalable solution for IRSTD, with strong potential for deployment in real systems. The key contributions are the size-aware PMG, the FAF/MDR updating scheme, and extensive ablations validating robustness to hyperparameters and cropping choices.

Abstract

Single-frame infrared small target (SIRST) detection poses a significant challenge due to the requirement to discern minute targets amidst complex infrared background clutter. In this paper, we focus on a weakly-supervised paradigm to obtain high-quality pseudo masks from the point-level annotation by integrating a novel learning-free method with the hybrid of the learning-based method. The learning-free method adheres to a sequential process, progressing from a point annotation to the bounding box that encompasses the target, and subsequently to detailed pseudo masks, while the hybrid is achieved through filtering out false alarms and retrieving missed detections in the network's prediction to provide a reliable supplement for learning-free masks. The experimental results show that our learning-free method generates pseudo masks with an average Intersection over Union (IoU) that is 4.3% higher than the second-best learning-free competitor across three datasets, while the hybrid learning-based method further enhances the quality of pseudo masks, achieving an additional average IoU increase of 3.4%.

Hybrid Mask Generation for Infrared Small Target Detection with Single-Point Supervision

TL;DR

This work tackles infrared small target detection with single-point supervision by introducing Hybrid Mask Generation (HMG), which combines a learning-free, size-adaptive Point-to-Mask Generation (PMG) with a learning-based updating framework. PMG converts point annotations into bounding boxes (Point-to-Box) and then into initial pseudo masks (Box-to-Mask) using region-based thresholds and directional probability maps, while Pseudo Mask Updating (FAF and MDR) merges neural predictions with these masks to produce high-quality, hybrid supervision. Across three SIRST datasets, HMG achieves IoU gains over prior single-point methods and approaches fully supervised performance in some configurations, while maintaining low false-alarm rates and robust performance under real-world perturbations. The method reduces annotation costs and provides a practical, scalable solution for IRSTD, with strong potential for deployment in real systems. The key contributions are the size-aware PMG, the FAF/MDR updating scheme, and extensive ablations validating robustness to hyperparameters and cropping choices.

Abstract

Single-frame infrared small target (SIRST) detection poses a significant challenge due to the requirement to discern minute targets amidst complex infrared background clutter. In this paper, we focus on a weakly-supervised paradigm to obtain high-quality pseudo masks from the point-level annotation by integrating a novel learning-free method with the hybrid of the learning-based method. The learning-free method adheres to a sequential process, progressing from a point annotation to the bounding box that encompasses the target, and subsequently to detailed pseudo masks, while the hybrid is achieved through filtering out false alarms and retrieving missed detections in the network's prediction to provide a reliable supplement for learning-free masks. The experimental results show that our learning-free method generates pseudo masks with an average Intersection over Union (IoU) that is 4.3% higher than the second-best learning-free competitor across three datasets, while the hybrid learning-based method further enhances the quality of pseudo masks, achieving an additional average IoU increase of 3.4%.
Paper Structure (30 sections, 4 equations, 9 figures, 5 tables)

This paper contains 30 sections, 4 equations, 9 figures, 5 tables.

Figures (9)

  • Figure 1: Illustration samples from the IRSTD-1K ISNet and NUDT-SIRST DNANet datasets showcase different mask generation techniques. The learning-free methods are often constrained by cropping size, leading to missed detections in pixel-level outside the cropping area. The deep-learning model is not constrained by spatial limitations, while inevitably facing false alarms or missed detections in target-level. The hybrid method, combining the strengths of both learning-free and deep-learning models, obtains high-quality masks.
  • Figure 2: Illustration of the proposed pseudo-mask generation process. The PMG module is utilized to generate initial masks for the supervision of the IRSTD model. The Pseudo Mask Updating module, composed of FAF and MDR components, is used to combine the initial mask with the model's predictions for higher-quality pseudo masks.
  • Figure 3: The illustration of obtaining the left boundary of the bounding box from the point label.
  • Figure 4: Impacts of cropping size on average of three datasets for the quality of pseudo masks with different sizes of small targets for MCLC li2023monte and our PMG.
  • Figure 5: Impacts of $L_{dp}$ and $\alpha$ for the quality of initial pseudo masks on three datasets.
  • ...and 4 more figures