Table of Contents
Fetching ...

SDF2Net: Shallow to Deep Feature Fusion Network for PolSAR Image Classification

Mohammed Q. Alkhatib, M. Sami Zitouni, Mina Al-Saad, Nour Aburaed, Hussain Al-Ahmad

TL;DR

This work addresses PolSAR image classification, where polarization and phase information pose unique challenges for traditional learning approaches. It introduces SDF2Net, a three-branch CV-3D-CNN that fuses shallow, medium, and deep features, followed by an attention module and fully connected classifier to exploit complex-valued data. On AIRSAR/ESAR datasets Flevoland, San Francisco, and Oberpfaffenhofen, it achieves state-of-the-art OA, AA, and kappa, with notable gains under limited training data (e.g., OA up to $97.13\%$ on San Francisco). Median post-processing further boosts accuracy, underscoring the method’s robustness and practicality for PolSAR-based land cover mapping.

Abstract

Polarimetric synthetic aperture radar (PolSAR) images encompass valuable information that can facilitate extensive land cover interpretation and generate diverse output products. Extracting meaningful features from PolSAR data poses challenges distinct from those encountered in optical imagery. Deep learning (DL) methods offer effective solutions for overcoming these challenges in PolSAR feature extraction. Convolutional neural networks (CNNs) play a crucial role in capturing PolSAR image characteristics by leveraging kernel capabilities to consider local information and the complex-valued nature of PolSAR data. In this study, a novel three-branch fusion of complex-valued CNN, named the Shallow to Deep Feature Fusion Network (SDF2Net), is proposed for PolSAR image classification. To validate the performance of the proposed method, classification results are compared against multiple state-of-the-art approaches using the airborne synthetic aperture radar (AIRSAR) datasets of Flevoland and San Francisco, as well as the ESAR Oberpfaffenhofen dataset. The results indicate that the proposed approach demonstrates improvements in overallaccuracy, with a 1.3% and 0.8% enhancement for the AIRSAR datasets and a 0.5% improvement for the ESAR dataset. Analyses conducted on the Flevoland data underscore the effectiveness of the SDF2Net model, revealing a promising overall accuracy of 96.01% even with only a 1% sampling ratio.

SDF2Net: Shallow to Deep Feature Fusion Network for PolSAR Image Classification

TL;DR

This work addresses PolSAR image classification, where polarization and phase information pose unique challenges for traditional learning approaches. It introduces SDF2Net, a three-branch CV-3D-CNN that fuses shallow, medium, and deep features, followed by an attention module and fully connected classifier to exploit complex-valued data. On AIRSAR/ESAR datasets Flevoland, San Francisco, and Oberpfaffenhofen, it achieves state-of-the-art OA, AA, and kappa, with notable gains under limited training data (e.g., OA up to on San Francisco). Median post-processing further boosts accuracy, underscoring the method’s robustness and practicality for PolSAR-based land cover mapping.

Abstract

Polarimetric synthetic aperture radar (PolSAR) images encompass valuable information that can facilitate extensive land cover interpretation and generate diverse output products. Extracting meaningful features from PolSAR data poses challenges distinct from those encountered in optical imagery. Deep learning (DL) methods offer effective solutions for overcoming these challenges in PolSAR feature extraction. Convolutional neural networks (CNNs) play a crucial role in capturing PolSAR image characteristics by leveraging kernel capabilities to consider local information and the complex-valued nature of PolSAR data. In this study, a novel three-branch fusion of complex-valued CNN, named the Shallow to Deep Feature Fusion Network (SDF2Net), is proposed for PolSAR image classification. To validate the performance of the proposed method, classification results are compared against multiple state-of-the-art approaches using the airborne synthetic aperture radar (AIRSAR) datasets of Flevoland and San Francisco, as well as the ESAR Oberpfaffenhofen dataset. The results indicate that the proposed approach demonstrates improvements in overallaccuracy, with a 1.3% and 0.8% enhancement for the AIRSAR datasets and a 0.5% improvement for the ESAR dataset. Analyses conducted on the Flevoland data underscore the effectiveness of the SDF2Net model, revealing a promising overall accuracy of 96.01% even with only a 1% sampling ratio.
Paper Structure (20 sections, 10 equations, 12 figures, 12 tables)

This paper contains 20 sections, 10 equations, 12 figures, 12 tables.

Figures (12)

  • Figure 1: Illustration of different types of convolution on images with multiple channels. (a) 2D Convolution; (b) 3D Convolution; (c) Complex Valued 3D Convolution.
  • Figure 2: Squeeze and Excitation Block.
  • Figure 3: Block diagram of the proposed SDF2Net.
  • Figure 4: Flevoland PolSAR data (left) Pauli RGB composite (right) Reference class map.
  • Figure 5: San Francisco PolSAR data (left) Pauli RGB composite (right) Reference class map.
  • ...and 7 more figures