SSL4EO-S12: A Large-Scale Multi-Modal, Multi-Temporal Dataset for Self-Supervised Learning in Earth Observation
Yi Wang, Nassim Ait Ali Braham, Zhitong Xiong, Chenying Liu, Conrad M Albrecht, Xiao Xiang Zhu
TL;DR
SSL4EO-S12 provides a scalable, multimodal, multi-temporal unlabeled dataset for self-supervised learning in Earth observation, leveraging Sentinel-1/2 data to enable in-domain pre-training. The authors benchmark MoCo-v2, DINO, MAE, and data2vec, showing consistent improvements over supervised baselines and existing RS pre-training datasets across classification, segmentation, and change-detection tasks. Ablation studies highlight the value of multimodality, seasonal information, atmospheric correction augmentation, and scaling, with t-SNE visualizations confirming meaningful, class-oriented representations. The dataset and accompanying code are openly available, offering a substantial resource for advancing SSL in EO and facilitating transfer to diverse RS applications. Overall, SSL4EO-S12 demonstrates that large-scale, EO-specific SSL can achieve competitive downstream performance and generalize across multiple RS benchmarks.
Abstract
Self-supervised pre-training bears potential to generate expressive representations without human annotation. Most pre-training in Earth observation (EO) are based on ImageNet or medium-size, labeled remote sensing (RS) datasets. We share an unlabeled RS dataset SSL4EO-S12 (Self-Supervised Learning for Earth Observation - Sentinel-1/2) to assemble a large-scale, global, multimodal, and multi-seasonal corpus of satellite imagery from the ESA Sentinel-1 \& -2 satellite missions. For EO applications we demonstrate SSL4EO-S12 to succeed in self-supervised pre-training for a set of methods: MoCo-v2, DINO, MAE, and data2vec. Resulting models yield downstream performance close to, or surpassing accuracy measures of supervised learning. In addition, pre-training on SSL4EO-S12 excels compared to existing datasets. We make openly available the dataset, related source code, and pre-trained models at https://github.com/zhu-xlab/SSL4EO-S12.
