IrisNet: Infrared Image Status Awareness Meta Decoder for Infrared Small Targets Detection
Xuelin Qian, Jiaming Lu, Zixuan Wang, Wenxuan Wang, Zhongling Huang, Dingwen Zhang, Junwei Han
TL;DR
IRSTD suffers from pattern drift and low SNR across diverse environments. The authors introduce IrisNet, a meta-learned framework that dynamically generates the entire decoder conditioned on infrared image status through an image-to-decoder transformer and a structured 2D decoder representation. Key contributions include (1) dynamic image-to-decoder mapping, (2) a structured decoder that preserves inter-layer parameter relationships, and (3) high-frequency augmentation in the encoder to improve edge and target cues. Empirical results on NUAA-SIRST, NUDT-SIRST, and IRSTD-1K demonstrate state-of-the-art performance and improved robustness for infrared small-target detection.
Abstract
Infrared Small Target Detection (IRSTD) faces significant challenges due to low signal-to-noise ratios, complex backgrounds, and the absence of discernible target features. While deep learning-based encoder-decoder frameworks have advanced the field, their static pattern learning suffers from pattern drift across diverse scenarios (\emph{e.g.}, day/night variations, sky/maritime/ground domains), limiting robustness. To address this, we propose IrisNet, a novel meta-learned framework that dynamically adapts detection strategies to the input infrared image status. Our approach establishes a dynamic mapping between infrared image features and entire decoder parameters via an image-to-decoder transformer. More concretely, we represent the parameterized decoder as a structured 2D tensor preserving hierarchical layer correlations and enable the transformer to model inter-layer dependencies through self-attention while generating adaptive decoding patterns via cross-attention. To further enhance the perception ability of infrared images, we integrate high-frequency components to supplement target-position and scene-edge information. Experiments on NUDT-SIRST, NUAA-SIRST, and IRSTD-1K datasets demonstrate the superiority of our IrisNet, achieving state-of-the-art performance.
