Table of Contents
Fetching ...

Dual Contrastive Network for Few-Shot Remote Sensing Image Scene Classification

Zhong Ji, Liyuan Hou, Xuan Wang, Gang Wang, Yanwei Pang

Abstract

Few-shot remote sensing image scene classification (FS-RSISC) aims at classifying remote sensing images with only a few labeled samples. The main challenges lie in small inter-class variances and large intra-class variances, which are the inherent property of remote sensing images. To address these challenges, we propose a transfer-based Dual Contrastive Network (DCN), which incorporates two auxiliary supervised contrastive learning branches during the training process. Specifically, one is a Context-guided Contrastive Learning (CCL) branch and the other is a Detail-guided Contrastive Learning (DCL) branch, which focus on inter-class discriminability and intra-class invariance, respectively. In the CCL branch, we first devise a Condenser Network to capture context features, and then leverage a supervised contrastive learning on top of the obtained context features to facilitate the model to learn more discriminative features. In the DCL branch, a Smelter Network is designed to highlight the significant local detail information. And then we construct a supervised contrastive learning based on the detail feature maps to fully exploit the spatial information in each map, enabling the model to concentrate on invariant detail features. Extensive experiments on four public benchmark remote sensing datasets demonstrate the competitive performance of our proposed DCN.

Dual Contrastive Network for Few-Shot Remote Sensing Image Scene Classification

Abstract

Few-shot remote sensing image scene classification (FS-RSISC) aims at classifying remote sensing images with only a few labeled samples. The main challenges lie in small inter-class variances and large intra-class variances, which are the inherent property of remote sensing images. To address these challenges, we propose a transfer-based Dual Contrastive Network (DCN), which incorporates two auxiliary supervised contrastive learning branches during the training process. Specifically, one is a Context-guided Contrastive Learning (CCL) branch and the other is a Detail-guided Contrastive Learning (DCL) branch, which focus on inter-class discriminability and intra-class invariance, respectively. In the CCL branch, we first devise a Condenser Network to capture context features, and then leverage a supervised contrastive learning on top of the obtained context features to facilitate the model to learn more discriminative features. In the DCL branch, a Smelter Network is designed to highlight the significant local detail information. And then we construct a supervised contrastive learning based on the detail feature maps to fully exploit the spatial information in each map, enabling the model to concentrate on invariant detail features. Extensive experiments on four public benchmark remote sensing datasets demonstrate the competitive performance of our proposed DCN.
Paper Structure (28 sections, 21 equations, 10 figures, 5 tables)

This paper contains 28 sections, 21 equations, 10 figures, 5 tables.

Figures (10)

  • Figure 1: Illustrations of remote sensing image characteristics: small inter-class variances and large intra-class variances. See texts for details.
  • Figure 2: Overview of the proposed DCN framework. It consists of a Context-guided Contrastive Learning (CCL) branch and a Detail-guided Contrastive Learning (DCL) branch to facilitate the model to learn more discriminative and detail information to generate context and detail features, both of which are used for training their corresponding classifiers.
  • Figure 3: Illustration of Condenser Network. With the Squeeze and Expand operations among channels, the scene-relevant information is enhanced while the irrelevant information is ignored. After the global average pooling, the context features are generated.
  • Figure 4: Illustration of Smelter Network. The helpful information in the channel and spatial dimensions is extracted respectively to enhance the significant local detail regions of the feature maps.
  • Figure 5: The inter-class variances and intra-class variances on the NWPU-RESISC45 dataset on 5-way 1-shot with baseline (ResNet), Condenser Network-integrated (ResNet+CN) and Smelter Network-integrated (ResNet+SN).
  • ...and 5 more figures