Transformer for Multitemporal Hyperspectral Image Unmixing
Hang Li, Qiankun Dong, Xueshuo Xie, Xia Xu, Tao Li, Zhenwei Shi
TL;DR
This work tackles multitemporal hyperspectral image unmixing (MTHU) by introducing MUFormer, an end-to-end unsupervised transformer framework. It uses a CNN encoder followed by a Global Awareness Module (GAM) to capture global temporal–spatial–spectral dependencies and a Change Enhancement Module (CEM) to model fine-grained changes between adjacent time phases, with a phase-wise linear decoder to estimate endmembers. The model is trained with a composite loss $L = \beta L_{RE} + \gamma L_{SAD} + \lambda L_E$, balancing reconstruction quality, spectral fidelity, and endmember consistency. Experiments on a real Lake Tahoe sequence and two synthetic datasets show state-of-the-art performance in abundance and endmember estimation, demonstrating strong potential for robust multitemporal unmixing and long-term surface monitoring, while hinting at future directions for denoising and more subtle change handling.
Abstract
Multitemporal hyperspectral image unmixing (MTHU) holds significant importance in monitoring and analyzing the dynamic changes of surface. However, compared to single-temporal unmixing, the multitemporal approach demands comprehensive consideration of information across different phases, rendering it a greater challenge. To address this challenge, we propose the Multitemporal Hyperspectral Image Unmixing Transformer (MUFormer), an end-to-end unsupervised deep learning model. To effectively perform multitemporal hyperspectral image unmixing, we introduce two key modules: the Global Awareness Module (GAM) and the Change Enhancement Module (CEM). The Global Awareness Module computes self-attention across all phases, facilitating global weight allocation. On the other hand, the Change Enhancement Module dynamically learns local temporal changes by comparing endmember changes between adjacent phases. The synergy between these modules allows for capturing semantic information regarding endmember and abundance changes, thereby enhancing the effectiveness of multitemporal hyperspectral image unmixing. We conducted experiments on one real dataset and two synthetic datasets, demonstrating that our model significantly enhances the effect of multitemporal hyperspectral image unmixing.
