Neural networks for geospatial data

Wentao Zhan; Abhirup Datta

Neural networks for geospatial data

Wentao Zhan, Abhirup Datta

TL;DR

This work advances geostatistical modeling by integrating neural networks into Gaussian process mean functions while preserving explicit spatial covariance via generalized least squares (GLS) loss. The NN-GLS algorithm is shown to be realizable as a two-layer graph neural network using nearest-neighbor Gaussian process (NNGP) structure, enabling scalable mini-batching, backpropagation, and kriging. The authors provide rigorous theory for existence, consistency under irregular spatial designs, and finite-sample error rates that quantify the advantage of incorporating accurate spatial covariance over vanilla NN with independence assumptions. Simulations and a PM$_{2.5}$ case study demonstrate that NN-GLS yields superior estimation and prediction performance, reliable uncertainty quantification via spatial bootstrap, and interpretable covariate effects through the nonlinear mean function. Overall, NN-GLS offers a principled, scalable framework for nonparametric mean modeling in spatial settings, with broad applicability to irregular geospatial data and complex covariate interactions.

Abstract

Analysis of geospatial data has traditionally been model-based, with a mean model, customarily specified as a linear regression on the covariates, and a covariance model, encoding the spatial dependence. We relax the strong assumption of linearity and propose embedding neural networks directly within the traditional geostatistical models to accommodate non-linear mean functions while retaining all other advantages including use of Gaussian Processes to explicitly model the spatial covariance, enabling inference on the covariate effect through the mean and on the spatial dependence through the covariance, and offering predictions at new locations via kriging. We propose NN-GLS, a new neural network estimation algorithm for the non-linear mean in GP models that explicitly accounts for the spatial covariance through generalized least squares (GLS), the same loss used in the linear case. We show that NN-GLS admits a representation as a special type of graph neural network (GNN). This connection facilitates use of standard neural network computational techniques for irregular geospatial data, enabling novel and scalable mini-batching, backpropagation, and kriging schemes. Theoretically, we show that NN-GLS will be consistent for irregularly observed spatially correlated data processes. We also provide a finite sample concentration rate, which quantifies the need to accurately model the spatial covariance in neural networks for dependent data. To our knowledge, these are the first large-sample results for any neural network algorithm for irregular spatial data. We demonstrate the methodology through simulated and real datasets.

Neural networks for geospatial data

TL;DR

case study demonstrate that NN-GLS yields superior estimation and prediction performance, reliable uncertainty quantification via spatial bootstrap, and interpretable covariate effects through the nonlinear mean function. Overall, NN-GLS offers a principled, scalable framework for nonparametric mean modeling in spatial settings, with broad applicability to irregular geospatial data and complex covariate interactions.

Abstract

Paper Structure (54 sections, 17 theorems, 128 equations, 32 figures, 1 table, 1 algorithm)

This paper contains 54 sections, 17 theorems, 128 equations, 32 figures, 1 table, 1 algorithm.

Introduction
Preliminaries
Process-based geostatistical modeling
Neural networks
Neural networks for Gaussian process models
NN-GLS: Neural networks using GLS loss
NN-GLS as Graph Neural Network
Mini-batching
Back-propagation
Kriging
Theory
Notations and Assumptions
Main results
Matérn Gaussian processes
Finite-sample error rates
...and 39 more sections

Key Result

Theorem 1

Given data $(Y_i, \bm{X}_i, s_i), i = 1, \cdots, n$ generated from (model-sp-nonlinear) under Assumption Asmp-1, and a working precision matrix $\mathbf{Q}$satisfying Assumption Asmp-3, with the function classes $\mathcal{F}_n$ defined in (def-F_n), there exists a seive estimator $\widehat{f}_n$ suc

Figures (32)

Figure 1: NN-GLS as a graph neural network with two graph convolution layers
Figure 2: (a): The estimation performance comparison with $f_0 = f_2$; (b): (Section \ref{['sec-sim-friedman']}) Estimation performance comparison among GAM, GAMGLS and NN-GLS against $rho$, i.e. the interaction strength; (c): The prediction performance comparison with $f_0 = f_2$; (d): (Section \ref{['sec-sim-NN']}) Prediction performance comparison among non-spatial NN, NN-GLS and NN-splines against sample size; (e): (Section \ref{['sec-sim-large']}) The consistency of estimation. (f): (Section \ref{['sec-sim-large']}) The running time for estimation.
Figure 3: PM$_{2.5}$ data analysis.
Figure 4: Partial dependency plots showing the marginal effects of temperature and west wind (U-wind).
Figure S5: Parameter estimations in NN-GLS. We denote NN-nonspatial by NN here to save space for axis text.
...and 27 more figures

Theorems & Definitions (20)

Theorem 1: Existence of seive estimator
Theorem 2: Consistency
Proposition 1
Proposition 2
Theorem 3: Convergence rate
Theorem S1: Theorem 2.2 in white1991some
Lemma S1
Lemma S2
Definition S4.1: Orlicz norm
Proposition S1
...and 10 more

Neural networks for geospatial data

TL;DR

Abstract

Neural networks for geospatial data

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (32)

Theorems & Definitions (20)