Spatial-Spectral Diffusion Contrastive Representation Network for Hyperspectral Image Classification

Yimin Zhu; Linlin Xu

Spatial-Spectral Diffusion Contrastive Representation Network for Hyperspectral Image Classification

Yimin Zhu, Linlin Xu

TL;DR

The paper tackles hyperspectral image classification under spatial-spectral heterogeneity and noise by marrying denoising diffusion probabilistic models with contrastive learning. It introduces DiffCRN, a two-stage framework with a staged DDPM backbone featuring spatial and spectral self-attention denoising modules, an adaptive time-step sampler using pixel-level spectral angle mapping, and a fusion-enabled classifier (AWAM and CTSSFM). The approach yields strong, unsupervised-to-supervised performance across four standard HSIs, with notable gains under limited training data and robust per-class accuracy. The findings suggest diffusion-based feature learning, combined with adaptive time-step selection and cross-time-step fusion, provides a practical and scalable pathway for HSIC in real-world applications.

Abstract

Although efficient extraction of discriminative spatial-spectral features is critical for hyperspectral images classification (HSIC), it is difficult to achieve these features due to factors such as the spatial-spectral heterogeneity and noise effect. This paper presents a Spatial-Spectral Diffusion Contrastive Representation Network (DiffCRN), based on denoising diffusion probabilistic model (DDPM) combined with contrastive learning (CL) for HSIC, with the following characteristics. First,to improve spatial-spectral feature representation, instead of adopting the UNets-like structure which is widely used for DDPM, we design a novel staged architecture with spatial self-attention denoising module (SSAD) and spectral group self-attention denoising module (SGSAD) in DiffCRN with improved efficiency for spectral-spatial feature learning. Second, to improve unsupervised feature learning efficiency, we design new DDPM model with logarithmic absolute error (LAE) loss and CL that improve the loss function effectiveness and increase the instance-level and inter-class discriminability. Third, to improve feature selection, we design a learnable approach based on pixel-level spectral angle mapping (SAM) for the selection of time steps in the proposed DDPM model in an adaptive and automatic manner. Last, to improve feature integration and classification, we design an Adaptive weighted addition modul (AWAM) and Cross time step Spectral-Spatial Fusion Module (CTSSFM) to fuse time-step-wise features and perform classification. Experiments conducted on widely used four HSI datasets demonstrate the improved performance of the proposed DiffCRN over the classical backbone models and state-of-the-art GAN, transformer models and other pretrained methods. The source code and pre-trained model will be made available publicly.

Spatial-Spectral Diffusion Contrastive Representation Network for Hyperspectral Image Classification

TL;DR

Abstract

Spatial-Spectral Diffusion Contrastive Representation Network for Hyperspectral Image Classification

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (18)