Gold Exploration using Representations from a Multispectral Autoencoder
Argyro Tsandalidou, Konstantinos Dogeas, Eleftheria Tetoula Tsonga, Elisavet Parselia, Georgios Tsimiklis, George Arvanitakis
TL;DR
The paper addresses gold prospectivity mapping from satellite imagery under limited labeled data by leveraging representations learned from unlabeled multispectral Sentinel-2 data. It introduces Isometric, a frozen Masked Autoencoder-based foundation model trained on the large FalconSpace-S2v1.0 dataset to produce spectral-spatial embeddings, which feed a lightweight XGBoost classifier for gold vs non-gold detection. Compared to using raw multispectral inputs and to the state-of-the-art SpectralGPT, the representation-based approach yields notable gains in both patch-level and image-level accuracy, evidencing robust transfer of mineralogical patterns. The findings demonstrate the potential of foundation-model representations to enable scalable, globally applicable mineral exploration from space, with the encoder acting as a reusable feature extractor for limited-label tasks; future work envisions larger datasets and data fusion with SAR and hyperspectral modalities plus multi-temporal analysis.
Abstract
Satellite imagery is employed for large-scale prospectivity mapping due to the high cost and typically limited availability of on-site mineral exploration data. In this work, we present a proof-of-concept framework that leverages generative representations learned from multispectral Sentinel-2 imagery to identify gold-bearing regions from space. An autoencoder foundation model, called Isometric, which is pretrained on the large-scale FalconSpace-S2 v1.0 dataset, produces information-dense spectral-spatial representations that serve as inputs to a lightweight XGBoost classifier. We compare this representation-based approach with a raw spectral input baseline using a dataset of 63 Sentinel-2 images from known gold and non-gold locations. The proposed method improves patch-level accuracy from 0.51 to 0.68 and image-level accuracy from 0.55 to 0.73, demonstrating that generative embeddings capture transferable mineralogical patterns even with limited labeled data. These results highlight the potential of foundation-model representations to make mineral exploration more efficient, scalable, and globally applicable.
