Satellite Image Time Series Semantic Change Detection: Novel Architecture and Analysis of Domain Shift
Elliot Vincent, Jean Ponce, Mathieu Aubry
TL;DR
This work tackles semantic change detection over satellite image time series (SITS-SCD) by introducing a multi-temporal architecture that leverages long-term temporal information through a novel temporal attention mechanism, outputting a segmentation map for each timestamp. The method outperforms mono- and bi-temporal baselines on DynamicEarthNet and MUDS while scaling more efficiently with model size, though gains do not always translate to change-detection accuracy. A systematic analysis of temporal and spatial domain shifts reveals that spatial shifts cause the largest performance drops, and temporal shifts mainly degrade change-detection performance while having a smaller effect on semantic segmentation. The study highlights data scarcity and the rarity of meaningful changes as key practical limits, underscoring the need for robust domain adaptation and more diverse annotated SITS data for reliable global monitoring. Overall, the paper advances SITS-SCD by combining long-range temporal reasoning with a principled assessment of domain shift impacts, informing future research and practical deployment in Earth observation workflows.
Abstract
Satellite imagery plays a crucial role in monitoring changes happening on Earth's surface and aiding in climate analysis, ecosystem assessment, and disaster response. In this paper, we tackle semantic change detection with satellite image time series (SITS-SCD) which encompasses both change detection and semantic segmentation tasks. We propose a new architecture that improves over the state of the art, scales better with the number of parameters, and leverages long-term temporal information. However, for practical use cases, models need to adapt to spatial and temporal shifts, which remains a challenge. We investigate the impact of temporal and spatial shifts separately on global, multi-year SITS datasets using DynamicEarthNet and MUDS. We show that the spatial domain shift represents the most complex setting and that the impact of temporal shift on performance is more pronounced on change detection than on semantic segmentation, highlighting that it is a specific issue deserving further attention.
