MTSGL: Multi-Task Structure Guided Learning for Robust and Interpretable SAR Aircraft Recognition
Qishan He, Lingjun Zhao, Ru Luo, Siqian Zhang, Lin Lei, Kefeng Ji, Gangyao Kuang
TL;DR
This work tackles robust and interpretable aircraft recognition in SAR imagery by introducing a structure-based annotation method and a multi-task structure guided learning (MTSGL) framework. MTSGL integrates two structure-focused tasks—Structural Semantic Awareness (SSA) and Structural Consistency Regularization (SCR)—with a Pareto-oriented gradient descent to balance learning across tasks, enabling the encoder to internalize aircraft structural knowledge. On the MT-SARD dataset, MTSGL achieves state-of-the-art performance, with a notable OA improvement from 83.37% (baseline) to 90.21% and a Kappa rise to approximately 0.886 when using Pareto optimization, while also providing interpretable structural evidence via segmentation and attention visualizations. The approach demonstrates enhanced robustness, particularly under limited training data and annotation imperfections, and offers a practical path toward more interpretable SAR ATR systems, with future work oriented toward weakly supervised annotation to further reduce labeling costs.
Abstract
Aircraft recognition in synthetic aperture radar (SAR) imagery is a fundamental mission in both military and civilian applications. Recently deep learning (DL) has emerged a dominant paradigm for its explosive performance on extracting discriminative features. However, current classification algorithms focus primarily on learning decision hyperplane without enough comprehension on aircraft structural knowledge. Inspired by the fined aircraft annotation methods for optical remote sensing images (RSI), we first introduce a structure-based SAR aircraft annotations approach to provide structural and compositional supplement information. On this basis, we propose a multi-task structure guided learning (MTSGL) network for robust and interpretable SAR aircraft recognition. Besides the classification task, MTSGL includes a structural semantic awareness (SSA) module and a structural consistency regularization (SCR) module. The SSA is designed to capture structure semantic information, which is conducive to gain human-like comprehension of aircraft knowledge. The SCR helps maintain the geometric consistency between the aircraft structure in SAR imagery and the proposed annotation. In this process, the structural attribute can be disentangled in a geometrically meaningful manner. In conclusion, the MTSGL is presented with the expert-level aircraft prior knowledge and structure guided learning paradigm, aiming to comprehend the aircraft concept in a way analogous to the human cognitive process. Extensive experiments are conducted on a self-constructed multi-task SAR aircraft recognition dataset (MT-SARD) and the effective results illustrate the superiority of robustness and interpretation ability of the proposed MTSGL.
