Cross-sensor super-resolution of irregularly sampled Sentinel-2 time series
Aimi Okabayashi, Nicolas Audebert, Simon Donike, Charlotte Pelletier
TL;DR
The paper tackles reconstructing high-resolution Sentinel-2 time-series imagery by cross-sensor MISR using SPOT-6 as the HR reference. It extends diffusion-based SR (SRDiff) with MISR backbones and introduces a time-equivariant fusion module (L-TAE) to handle irregular time sampling, enabling HR outputs at any target date within the LR sequence. A new BreizhSR dataset (Breizh region) of 52,255 S2–SPOT-6 pairs enables real-world evaluation at 4× resolution. Results show MISR with time-aware fusion improves both pixel- and perceptual-based metrics, though a trade-off between spectral fidelity and perceptual quality emerges, suggesting future work on joint losses and full-spectrum EO SR.
Abstract
Satellite imaging generally presents a trade-off between the frequency of acquisitions and the spatial resolution of the images. Super-resolution is often advanced as a way to get the best of both worlds. In this work, we investigate multi-image super-resolution of satellite image time series, i.e. how multiple images of the same area acquired at different dates can help reconstruct a higher resolution observation. In particular, we extend state-of-the-art deep single and multi-image super-resolution algorithms, such as SRDiff and HighRes-net, to deal with irregularly sampled Sentinel-2 time series. We introduce BreizhSR, a new dataset for 4x super-resolution of Sentinel-2 time series using very high-resolution SPOT-6 imagery of Brittany, a French region. We show that using multiple images significantly improves super-resolution performance, and that a well-designed temporal positional encoding allows us to perform super-resolution for different times of the series. In addition, we observe a trade-off between spectral fidelity and perceptual quality of the reconstructed HR images, questioning future directions for super-resolution of Earth Observation data.
