The Change You Want To Detect: Semantic Change Detection In Earth Observation With Hybrid Data Generation
Yanis Benidir, Nicolas Gonthier, Clement Mallet
TL;DR
This work tackles the scarcity and limited transferability of bi-temporal semantic change detection (SCD) data for very high resolution Earth observation. It introduces HySCDG, a diffusion-based pipeline that semantically guides inpainting to transform real VHR images into paired bi-temporal samples, producing FSC-180k, a large hybrid SCD pretraining dataset built on the FLAIR land-cover map. Pretraining with FSC-180k yields consistent improvements across five target datasets for both binary and semantic changes, outperforming fully synthetic baselines (e.g., SyntheWorld) in most configurations and enabling effective transfer in zero-shot, low-data, and mixed-training scenarios. The approach demonstrates robust cross-domain transfer and offers practical gains for scalable Earth observation change monitoring, with the dataset and tools publicly available for reuse.
Abstract
Bi-temporal change detection at scale based on Very High Resolution (VHR) images is crucial for Earth monitoring. This remains poorly addressed so far: methods either require large volumes of annotated data (semantic case), or are limited to restricted datasets (binary set-ups). Most approaches do not exhibit the versatility required for temporal and spatial adaptation: simplicity in architecture design and pretraining on realistic and comprehensive datasets. Synthetic datasets are the key solution but still fail to handle complex and diverse scenes. In this paper, we present HySCDG a generative pipeline for creating a large hybrid semantic change detection dataset that contains both real VHR images and inpainted ones, along with land cover semantic map at both dates and the change map. Being semantically and spatially guided, HySCDG generates realistic images, leading to a comprehensive and hybrid transfer-proof dataset FSC-180k. We evaluate FSC-180k on five change detection cases (binary and semantic), from zero-shot to mixed and sequential training, and also under low data regime training. Experiments demonstrate that pretraining on our hybrid dataset leads to a significant performance boost, outperforming SyntheWorld, a fully synthetic dataset, in every configuration. All codes, models, and data are available here: https://yb23.github.io/projects/cywd/
