From Easy to Hard: Progressive Active Learning Framework for Infrared Small Target Detection with Single Point Supervision
Chuang Yu, Jinmiao Zhao, Yunpeng Liu, Sicheng Zhao, Yimian Dai, Xiangyu Yue
TL;DR
The paper tackles the challenge of detecting infrared small targets with single point supervision by introducing a Progressive Active Learning (PAL) framework that guides SIRST networks from easy to hard samples. It combines a model pre-start phase (EPG) to bootstrap basic capabilities, a fine dual-update strategy (COU and FIU) to progressively refine pseudo-labels and incorporate harder samples, and a decay mechanism to balance label expansion and contraction. Through extensive experiments on multiple public datasets, PAL achieves state-of-the-art gains over LESPS and closely approaches full supervision performance, while offering improved stability under limited data. This work provides a practical bridge between full and weak supervision for SIRST detection and introduces novel mechanisms (EPG, COU, FIU, EEDM) that can be leveraged in weakly supervised segmentation settings.
Abstract
Recently, single-frame infrared small target (SIRST) detection with single point supervision has drawn wide-spread attention. However, the latest label evolution with single point supervision (LESPS) framework suffers from instability, excessive label evolution, and difficulty in exerting embedded network performance. Inspired by organisms gradually adapting to their environment and continuously accumulating knowledge, we construct an innovative Progressive Active Learning (PAL) framework, which drives the existing SIRST detection networks progressively and actively recognizes and learns harder samples. Specifically, to avoid the early low-performance model leading to the wrong selection of hard samples, we propose a model pre-start concept, which focuses on automatically selecting a portion of easy samples and helping the model have basic task-specific learning capabilities. Meanwhile, we propose a refined dual-update strategy, which can promote reasonable learning of harder samples and continuous refinement of pseudo-labels. In addition, to alleviate the risk of excessive label evolution, a decay factor is reasonably introduced, which helps to achieve a dynamic balance between the expansion and contraction of target annotations. Extensive experiments show that existing SIRST detection networks equipped with our PAL framework have achieved state-of-the-art (SOTA) results on multiple public datasets. Furthermore, our PAL framework can build an efficient and stable bridge between full supervision and single point supervision tasks. Our code is available at https://github.com/YuChuang1205/PAL
