Spatial Bayesian Neural Networks
Andrew Zammit-Mangion, Michael D. Kaminski, Ba-Hien Tran, Maurizio Filippone, Noel Cressie
TL;DR
The paper introduces Spatial Bayesian Neural Networks (SBNNs), a flexible, calibration-based approach for modelling spatial processes by embedding spatial structure and allowing parameter variation across space. SBNNs are calibrated to a target process using Wasserstein-1 distance, via a two-stage optimization that updates a differentiable surrogate for the objective and then the hyperparameters of the network, enabling accurate replication of a diverse set of spatial processes, including Gaussian, non-Gaussian, and max-stable models. Through simulation studies and case studies, the authors demonstrate that SBNNs with embedding layers and spatially varying parameters outperform vanilla BNNs and can serve as efficient surrogates for complex stochastic processes, while highlighting computational demands and interpretability limitations. The framework provides a versatile tool for spatial prediction and uncertainty quantification, with potential applicability to a wide range of replicated-realization settings and stochastic simulators, albeit not without open questions about predictive inference with spatially varying parameters and covariate integration.
Abstract
interpretable, and well understood models that are routinely employed even though, as is revealed through prior and posterior predictive checks, these can poorly characterise the spatial heterogeneity in the underlying process of interest. Here, we propose a new, flexible class of spatial-process models, which we refer to as spatial Bayesian neural networks (SBNNs). An SBNN leverages the representational capacity of a Bayesian neural network; it is tailored to a spatial setting by incorporating a spatial ``embedding layer'' into the network and, possibly, spatially-varying network parameters. An SBNN is calibrated by matching its finite-dimensional distribution at locations on a fine gridding of space to that of a target process of interest. That process could be easy to simulate from or we may have many realisations from it. We propose several variants of SBNNs, most of which are able to match the finite-dimensional distribution of the target process at the selected grid better than conventional BNNs of similar complexity. We also show that an SBNN can be used to represent a variety of spatial processes often used in practice, such as Gaussian processes, lognormal processes, and max-stable processes. We briefly discuss the tools that could be used to make inference with SBNNs, and we conclude with a discussion of their advantages and limitations.
