IncSAR: A Dual Fusion Incremental Learning Framework for SAR Target Recognition
George Karantaidis, Athanasios Pantsios, Ioannis Kompatsiaris, Symeon Papadopoulos
TL;DR
This work tackles catastrophic forgetting in SAR-ATR under class-incremental learning by introducing IncSAR, a dual-branch framework that fuses a pre-trained Vision Transformer with a custom SAR-CNN. The pipeline incorporates RPCA-based denoising, a fixed random projection layer to boost feature separability, and a decorrelated prototype classifier within a late-fusion architecture; lightweight variants (IncSAR_Lite) and an attention-enhanced fusion (IncSAR_LAtt) are also explored. Across MSTAR, SAR-AIRcraft-1.0, and OpenSARShip, IncSAR achieves state-of-the-art incremental performance with minimal forgetting, demonstrating strong cross-domain generalization and robustness under data-limited conditions. The approach offers a scalable, exemplar-free solution for real-world SAR-ATR deployment in dynamic environments, with practical implications for defense and remote sensing deployments.
Abstract
Deep learning techniques have achieved significant success in Synthetic Aperture Radar (SAR) target recognition using predefined datasets in static scenarios. However, real-world applications demand that models incrementally learn new information without forgetting previously acquired knowledge. The challenge of catastrophic forgetting, where models lose past knowledge when adapting to new tasks, remains a critical issue. In this paper, we introduce IncSAR, an incremental learning framework designed to tackle catastrophic forgetting in SAR target recognition. IncSAR combines the power of a Vision Transformer (ViT) and a custom-designed Convolutional Neural Network (CNN) in a dual-branch architecture, integrated via a late-fusion strategy. Additionally, we explore the use of TinyViT to reduce computational complexity and propose an attention mechanism to dynamically enhance feature representation. To mitigate the speckle noise inherent in SAR images, we employ a denoising module based on a neural network approximation of Robust Principal Component Analysis (RPCA), leveraging a simple neural network for efficient noise reduction in SAR imagery. Moreover, a random projection layer improves the linear separability of features, and a variant of Linear Discriminant Analysis (LDA) decorrelates extracted class prototypes for better generalization. Extensive experiments on the MSTAR, SAR-AIRcraft-1.0, and OpenSARShip benchmark datasets demonstrate that IncSAR significantly outperforms state-of-the-art approaches, achieving a 99.63\% average accuracy and a 0.33\% performance drop, representing an 89\% improvement in retention compared to existing techniques. The source code is available at https://github.com/geokarant/IncSAR.
