S4DL: Shift-sensitive Spatial-Spectral Disentangling Learning for Hyperspectral Image Unsupervised Domain Adaptation
Jie Feng, Tianshu Zhang, Junpeng Zhang, Ronghua Shang, Weisheng Dong, Guangming Shi, Licheng Jiao
TL;DR
This work tackles cross-scene hyperspectral image unsupervised domain adaptation by addressing spectral-channel shifts that traditional alignment methods overlook. It introduces S4DL, a framework combining gradient-guided spatial-spectral disentangling (GSSD), a shift-sensitive adaptive monitor (SSAM), and a reversible feature extractor (RFE) to separate domain-invariant from domain-specific channel information while preserving low-level details. The three components work together under a loss L_total = L_cls + λ_1 L_ortho + λ_2 L_dom to improve transferability across diverse scenes, as demonstrated on Houston, HyRANK, and S-H datasets where S4DL achieves state-of-the-art OA and Kappa. The approach offers a practical impact by enhancing cross-scene HSI classification with robust, computation-efficient domain adaptation that explicitly leverages spectral information for disentangling domain shifts.
Abstract
Unsupervised domain adaptation techniques, extensively studied in hyperspectral image (HSI) classification, aim to use labeled source domain data and unlabeled target domain data to learn domain invariant features for cross-scene classification. Compared to natural images, numerous spectral bands of HSIs provide abundant semantic information, but they also increase the domain shift significantly. In most existing methods, both explicit alignment and implicit alignment simply align feature distribution, ignoring domain information in the spectrum. We noted that when the spectral channel between source and target domains is distinguished obviously, the transfer performance of these methods tends to deteriorate. Additionally, their performance fluctuates greatly owing to the varying domain shifts across various datasets. To address these problems, a novel shift-sensitive spatial-spectral disentangling learning (S4DL) approach is proposed. In S4DL, gradient-guided spatial-spectral decomposition is designed to separate domain-specific and domain-invariant representations by generating tailored masks under the guidance of the gradient from domain classification. A shift-sensitive adaptive monitor is defined to adjust the intensity of disentangling according to the magnitude of domain shift. Furthermore, a reversible neural network is constructed to retain domain information that lies in not only in semantic but also the shallow-level detailed information. Extensive experimental results on several cross-scene HSI datasets consistently verified that S4DL is better than the state-of-the-art UDA methods. Our source code will be available at https://github.com/xdu-jjgs/S4DL.
