Highly Multivariate Large-scale Spatial Stochastic Processes -- A Cross-Markov Random Field Approach
Xiaoqing Chen, Peter Diggle, James V. Zidek, Gavin Shaddick
TL;DR
This work introduces a cross-Markov Random Field (cross-MRF) framework to tackle highly multivariate large-scale spatial processes by jointly learning a sparse precision matrix and an asymmetric cross-covariance structure. The approach pairs a first-stage CI-based mixed spatial graphical model with a second-stage cross-MRF that enforces doubly CI across both components and locations, achieving maximal sparsity in $\Sigma^{-1}_{np\times np}$ and reducing generation complexity to $\mathcal{O}(pn^2)$. The methodology is supported by theoretical extensions of the Hammersley-Clifford theorem to multivariate spatial processes and concrete linking strategies via moralisation and CAR-based univariate inverses. Empirical results on 1D simulations and 2D CAMS data demonstrate accurate capture of asymmetric cross-covariance, scalable computation, and interpretable neighborhood structures. The framework offers practical benefits for large-scale environmental and socio-economic spatio-temporal analyses, with clear avenues for distributed inference and spatio-temporal generalisation.
Abstract
Key challenges in the analysis of highly multivariate large-scale spatial stochastic processes, where both the number of components (p) and spatial locations (n) can be large, include achieving maximal sparsity in the joint precision matrix, ensuring efficient computational cost for its generation, accommodating asymmetric cross-covariance in the joint covariance matrix, and delivering scientific interpretability. We propose a cross-MRF model class, consisting of a mixed spatial graphical model framework and cross-MRF theory, to collectively address these challenges in one unified framework across two modelling stages. The first stage exploits scientifically informed conditional independence (CI) among p component fields and allows for a step-wise parallel generation of joint covariance and precision matrix, enabling a simultaneous accommodation of asymmetric cross-covariance in joint covariance matrix and sparsity in joint precision matrix. The second stage extends the first-stage CI to doubly CI among both p and n and unearths the cross-MRF via an extended Hammersley-Clifford theorem for multivariate spatial stochastic processes. This results in the sparsest possible representation of the joint precision matrix and ensures its lowest generation complexity. We demonstrate with 1D simulated comparative studies and 2D real-world data.
