A Generalized Unified Skew-Normal Process with Neural Bayes Inference
Kesen Wang, Marc G. Genton
TL;DR
This work tackles non-Gaussian spatial data exhibiting skewness and heavy tails by introducing the Generalized Unified Skew-Normal (GSUN) process, a flexible extension of SUN that yields vanishing correlations at large distances and a tractable kriging framework. A concise SUN re-parameterization with a diagonal skewness structure improves numerical stability and interpretability, while the GSUN construction integrates latent skewness from both observed and latent processes via two independent Gaussian components. To enable scalable inference in high dimensions, the authors develop neural Bayes estimators based on Graph Attention Networks and an encoder–transformer, trained through amortized simulation under uniform priors, and augmented with dropout and on-the-fly data generation. The approach is validated through extensive simulations, uncertainty quantification, and an application to Pb-contaminated soils, showing favorable PIT behavior and improved fit over Gaussian and Tukey g‑and‑h models, with SUGLG serving as a baseline competitor. Overall, the combination of a flexible GSUN spatial model and neural Bayes inference offers a principled, scalable framework for non-Gaussian spatial analysis with practical interpolation capabilities.
Abstract
In recent decades, statisticians have been increasingly encountering spatial data that exhibit non-Gaussian behaviors such as asymmetry and heavy-tailedness. As a result, the assumptions of symmetry and fixed tail weight in Gaussian processes have become restrictive and may fail to capture the intrinsic properties of the data. To address the limitations of the Gaussian models, a variety of skewed models has been proposed, of which the popularity has grown rapidly. These skewed models introduce parameters that govern skewness and tail weight. Among various proposals in the literature, unified skewed distributions, such as the Unified Skew-Normal (SUN), have received considerable attention. In this work, we revisit a more concise and intepretable re-parameterization of the SUN distribution and apply the distribution to random fields by constructing a generalized unified skew-normal (GSUN) spatial process. We demonstrate that the GSUN is a valid spatial process by showing its vanishing correlation in large distances and provide the corresponding spatial interpolation method. In addition, we develop an inference mechanism for the GSUN process using the concept of neural Bayes estimators with deep graphical attention networks (GATs) and encoder transformer. We show the superiority of our proposed estimator over the conventional CNN-based architectures regarding stability and accuracy by means of a simulation study and application to Pb-contaminated soil data. Furthermore, we show that the GSUN process is different from the conventional Gaussian processes and Tukey g-and-h processes, through the probability integral transform (PIT).
