Regression modelling of spatiotemporal extreme U.S. wildfires via partially-interpretable neural networks
Jordan Richards, Raphaël Huser
TL;DR
This work addresses the challenge of estimating and interpreting extreme-value quantiles for spatiotemporal environmental processes, focusing on U.S. wildfires. It introduces partially-interpretable neural networks (PINNs) that combine interpretable linear/spline components with neural-network-based nonparametric parts to model extreme-value regression within a point-process framework. The key contributions include the bGEV-PP model, a convolutional PINN architecture for tails, and a thorough data-driven analysis showing improved tail predictions and interpretable drivers of wildfire extremes, along with hazard maps and temporal trends. The framework is scalable to high-dimensional predictors and provides actionable insights into where and when extreme wildfires are most likely and how meteorological drivers shape their severity.
Abstract
Risk management in many environmental settings requires an understanding of the mechanisms that drive extreme events. Useful metrics for quantifying such risk are extreme quantiles of response variables conditioned on predictor variables that describe, e.g., climate, biosphere and environmental states. Typically these quantiles lie outside the range of observable data and so, for estimation, require specification of parametric extreme value models within a regression framework. Classical approaches in this context utilise linear or additive relationships between predictor and response variables and suffer in either their predictive capabilities or computational efficiency; moreover, their simplicity is unlikely to capture the truly complex structures that lead to the creation of extreme wildfires. In this paper, we propose a new methodological framework for performing extreme quantile regression using artificial neutral networks, which are able to capture complex non-linear relationships and scale well to high-dimensional data. The "black box" nature of neural networks means that they lack the desirable trait of interpretability often favoured by practitioners; thus, we unify linear, and additive, regression methodology with deep learning to create partially-interpretable neural networks that can be used for statistical inference but retain high prediction accuracy. To complement this methodology, we further propose a novel point process model for extreme values which overcomes the finite lower-endpoint problem associated with the generalised extreme value class of distributions. Efficacy of our unified framework is illustrated on U.S. wildfire data with a high-dimensional predictor set and we illustrate vast improvements in predictive performance over linear and spline-based regression techniques.
