HySpecNet-11k: A Large-Scale Hyperspectral Dataset for Benchmarking Learning-Based Hyperspectral Image Compression Methods
Martin Hermann Paul Fuchs, Begüm Demir
TL;DR
The paper tackles the data bottleneck in learning-based hyperspectral image compression by introducing HySpecNet-11k, a large-scale, unlabeled benchmark dataset of $11{,}483$ patches, each $128\times128$ with $202$ spectral bands, derived from EnMAP imagery. It benchmarks state-of-the-art convolutional autoencoders across spectral- and spatial-domain compression, revealing that spectral-domain compression via a 1D-CAE delivers the strongest rate-distortion performance (e.g., $PSNR$ ≈ $54.85$ dB at $CR$ ≈ $8.08$), while methods employing spatial downsampling lag in reconstruction quality. The dataset provides reproducible splits (patchwise easy and tilewise hard) and accompanying code, enabling robust evaluation of unsupervised learning on hyperspectral data. HySpecNet-11k is poised to advance research in hyperspectral learning by alleviating data scarcity and supporting generalization across diverse geographic regions, with plans to extend the dataset through additional EnMAP tiles.
Abstract
The development of learning-based hyperspectral image compression methods has recently attracted great attention in remote sensing. Such methods require a high number of hyperspectral images to be used during training to optimize all parameters and reach a high compression performance. However, existing hyperspectral datasets are not sufficient to train and evaluate learning-based compression methods, which hinders the research in this field. To address this problem, in this paper we present HySpecNet-11k that is a large-scale hyperspectral benchmark dataset made up of 11,483 nonoverlapping image patches. Each patch is a portion of 128 $\times$ 128 pixels with 224 spectral bands and a ground sample distance of 30 m. We exploit HySpecNet-11k to benchmark the current state of the art in learning-based hyperspectral image compression by focussing our attention on various 1D, 2D and 3D convolutional autoencoder architectures. Nevertheless, HySpecNet-11k can be used for any unsupervised learning task in the framework of hyperspectral image analysis. The dataset, our code and the pre-trained weights are publicly available at https://hyspecnet.rsim.berlin
