In-domain representation learning for remote sensing
Maxim Neumann, Andre Susano Pinto, Xiaohua Zhai, Neil Houlsby
TL;DR
Remote sensing has lagged in representation learning; the authors provide five standardized datasets and a common evaluation protocol to study in-domain RS representations. They train RS-specific representations via supervised fine-tuning on in-domain data and show this approach yields state-of-the-art transfer performance across unseen RS tasks, especially with limited downstream data. Their results reveal that multi-resolution and dataset quality factors (label accuracy, class diversity) influence the quality of learned representations, and that large weakly labeled RS datasets do not always outperform smaller, curated datasets. They also release the trained representations and code via TensorFlow Hub/TFDS to enable rapid reuse.
Abstract
Given the importance of remote sensing, surprisingly little attention has been paid to it by the representation learning community. To address it and to establish baselines and a common evaluation protocol in this domain, we provide simplified access to 5 diverse remote sensing datasets in a standardized form. Specifically, we investigate in-domain representation learning to develop generic remote sensing representations and explore which characteristics are important for a dataset to be a good source for remote sensing representation learning. The established baselines achieve state-of-the-art performance on these datasets.
