GeOT: A spatially explicit framework for evaluating spatio-temporal predictions
Nina Wiedemann, Théo Uscidda, Martin Raubal
TL;DR
GeOT presents a spatially explicit framework for evaluating spatio-temporal predictions by leveraging Optimal Transport to quantify relocation costs between predicted and ground-truth spatial distributions. By defining a cost matrix over locations, GeOT computes $W^{geo}_c$ (and its partial version $W^{geo}_{c,\phi}$) to reflect real-world operational costs, and shows how Sinkhorn-based OT losses can be used to train models with spatial awareness. Validation on synthetic data links OT to spatial autocorrelation measures like Moran's I, while case studies on bike sharing, charging, and traffic demonstrate improved alignment of predictions with spatially distributed costs and reveal scale- and application-dependent trade-offs. The framework offers a flexible, interpretable metric that can guide model selection, aggregation level, and loss design, with code available for reproducibility. Overall, GeOT advances GeoAI by integrating spatial cost considerations directly into evaluation and training, enabling more cost-aware and spatially coherent predictions.
Abstract
When predicting observations across space and time, the spatial layout of errors impacts a model's real-world utility. For instance, in bike sharing demand prediction, error patterns translate to relocation costs. However, commonly used error metrics in GeoAI evaluate predictions point-wise, neglecting effects such as spatial heterogeneity, autocorrelation, and the Modifiable Areal Unit Problem. We put forward Optimal Transport (OT) as a spatial evaluation metric and loss function. The proposed framework, called GeOT, assesses the performance of prediction models by quantifying the transport costs associated with their prediction errors. Through experiments on real and synthetic data, we demonstrate that 1) the spatial distribution of prediction errors relates to real-world costs in many applications, 2) OT captures these spatial costs more accurately than existing metrics, and 3) OT enhances comparability across spatial and temporal scales. Finally, we advocate for leveraging OT as a loss function in neural networks to improve the spatial accuracy of predictions. Experiments with bike sharing, charging station, and traffic datasets show that spatial costs are significantly reduced with only marginal changes to non-spatial error metrics. Thus, this approach not only offers a spatially explicit tool for model evaluation and selection, but also integrates spatial considerations into model training. All code is available at https://github.com/mie-lab/geospatialOT.
