Predicting and Interpolating Spatiotemporal Environmental Data: A Case Study of Groundwater Storage in Bangladesh
Anna Pazola, Mohammad Shamsudduha, Richard G. Taylor, Allan Tucker
TL;DR
The paper tackles joint spatiotemporal estimation of environmental fields from point observations, using groundwater storage changes in Bangladesh as a case study. It systematically compares grid-to-grid, grid-to-point, and spatiotemporal deep learning approaches (including Unet variants, CNN-CNN-LSTM stacks, Kriging, and DeepKriging), across 2D and 3D configurations with temporal cross-validation. A key finding is that spatial interpolation is substantially harder than temporal prediction, with interpolation performance highly sensitive to local hydrogeology and time-series dynamics; Kriging-based interpolation and grid-based inpainting can degrade accuracy at observed points. Despite interpolation challenges, the data-driven models outperform the physics-based GLDAS 2.2 CLSM baseline, and the authors recommend exploring clustering-informed interpolation and neural-process frameworks for scalable, accurate spatiotemporal predictions in complex environments; code is available at the provided GitHub repository.
Abstract
Geospatial observational datasets are often limited to point measurements, making temporal prediction and spatial interpolation essential for constructing continuous fields. This study evaluates two deep learning strategies for addressing this challenge: (1) a grid-to-grid approach, where gridded predictors are used to model rasterised targets (aggregation before modelling), and (2) a grid-to-point approach, where gridded predictors model point targets, followed by kriging interpolation to fill the domain (aggregation after modelling). Using groundwater storage data from Bangladesh as a case study, we compare the effcacy of these approaches. Our findings indicate that spatial interpolation is substantially more difficult than temporal prediction. In particular, nearest neighbours are not always the most similar, and uncertainties in geology strongly influence point temporal behaviour. These insights motivate future work on advanced interpolation methods informed by clustering locations based on time series dynamics. Demonstrated on groundwater storage, the conclusions are applicable to other environmental variables governed by indirectly observable factors. Code is available at https://github.com/pazolka/interpolation-prediction-gwsa.
