Hyperspectral Vision Transformers for Greenhouse Gas Estimations from Space
Ruben Gonzalez Avilés, Linus Scheibenreif, Nassim Ait Ali Braham, Benedikt Blumenstiel, Thomas Brunschwiler, Ranjini Guruprasad, Damian Borth, Conrad Albrecht, Paolo Fraccaro, Devyani Lambhate, Johannes Jakubik
TL;DR
This work tackles the trade-off between spectral resolution and spatial/temporal coverage in satellite-based greenhouse gas monitoring by introducing a spectral transformer masked autoencoder that reconstructs hyperspectral data from multispectral inputs. Pre-trained on band-wise masked hyperspectral data, the model is fine-tuned to map multispectral inputs to synthetic hyperspectral spectra, preserving spatial coherence and enabling more informative spectral signatures for downstream gas predictions. Across reconstruction and GHG-detection tasks for CH$_4$, NO$_2$, and CO$_2$, the approach yields improved spectral fidelity and often closer-to-hyperspectral performance than multispectral baselines, with notable gains in methane detection and mixed results for NO$_2$ depending on temporal alignment. The framework demonstrates the potential to extend hyperspectral-like capabilities to widespread multispectral data, enhancing atmospheric monitoring with self-supervised learning, while highlighting data-size and temporal-misalignment challenges for certain gases.
Abstract
Hyperspectral imaging provides detailed spectral information and holds significant potential for monitoring of greenhouse gases (GHGs). However, its application is constrained by limited spatial coverage and infrequent revisit times. In contrast, multispectral imaging offers broader spatial and temporal coverage but often lacks the spectral detail that can enhance GHG detection. To address these challenges, this study proposes a spectral transformer model that synthesizes hyperspectral data from multispectral inputs. The model is pre-trained via a band-wise masked autoencoder and subsequently fine-tuned on spatio-temporally aligned multispectral-hyperspectral image pairs. The resulting synthetic hyperspectral data retain the spatial and temporal benefits of multispectral imagery and improve GHG prediction accuracy relative to using multispectral data alone. This approach effectively bridges the trade-off between spectral resolution and coverage, highlighting its potential to advance atmospheric monitoring by combining the strengths of hyperspectral and multispectral systems with self-supervised deep learning.
