Dual-stage Hyperspectral Image Classification Model with Spectral Supertoken
Peifu Liu, Tingfa Xu, Jie Wang, Huan Chen, Huiyan Bai, Jianan Li
TL;DR
This work tackles boundary preservation and regional consistency in hyperspectral image classification by introducing the Dual-stage Spectral Supertoken Classifier (DSTC). DSTC first clusters spectrally similar pixels into spectral supertokens using spectrum-derivative features, then classifies these tokens with a Transformer and projects the results back to pixel space, guided by class-proportion-based soft labels $\boldsymbol{L} \in \mathbb{R}^{M\times C'}$ and token predictions $\hat{\boldsymbol{S}} \in \mathbb{R}^{M\times C'}$. Core contributions include the spectrum-derivative-based pixel clustering, semantic feature aggregation forming $\boldsymbol{S} \in \mathbb{R}^{M\times C_2}$, the CPSL supervision, and an end-to-end trainable two-stage pipeline, demonstrated on WHU-OHS, IP, KSC, UP with efficiency advantages (e.g., reduced FLOPs) and strong accuracy. The approach yields improved boundary delineation and regional coherence while enabling near real-time inference, with generalization validated by HS-SOD experiments and public code release.
Abstract
Hyperspectral image classification, a task that assigns pre-defined classes to each pixel in a hyperspectral image of remote sensing scenes, often faces challenges due to the neglect of correlations between spectrally similar pixels. This oversight can lead to inaccurate edge definitions and difficulties in managing minor spectral variations in contiguous areas. To address these issues, we introduce the novel Dual-stage Spectral Supertoken Classifier (DSTC), inspired by superpixel concepts. DSTC employs spectrum-derivative-based pixel clustering to group pixels with similar spectral characteristics into spectral supertokens. By projecting the classification of these tokens onto the image space, we achieve pixel-level results that maintain regional classification consistency and precise boundary. Moreover, recognizing the diversity within tokens, we propose a class-proportion-based soft label. This label adaptively assigns weights to different categories based on their prevalence, effectively managing data distribution imbalances and enhancing classification performance. Comprehensive experiments on WHU-OHS, IP, KSC, and UP datasets corroborate the robust classification capabilities of DSTC and the effectiveness of its individual components. Code will be publicly available at https://github.com/laprf/DSTC.
