Deep autoregressive modeling for land use land cover
Christopher Krapu, Mark Borsuk, Ryan Calder
TL;DR
The paper tackles the challenge of modeling land use / land cover (LULC) with long-range spatial dependencies by adapting a deep autoregressive, image-inpainting–style approach. It introduces PixelConstrainedCNN, a large-scale discrete autoregressive model that combines a PixelCNN prior with a ResNet-based auxiliary network to generate calibrated, conditional LULC distributions over regional patches; it is benchmarked against a SCCAR statistical baseline using NLCD 2019 data coarsened to $40\times40$ grids. Results show that the deep model captures intricate spatial patterns such as roads and water bodies and generates realistic completions, but its predictive distributions are underconfident (underdispersed) and calibration remains a challenge, with temperature tuning offering partial improvements. A Michigan state park case study demonstrates sequential infilling for larger regions and highlights the model’s potential for counterfactual landscape analysis, while pointing to limitations in long-range context and sampling efficiency. The work suggests future work to integrate additional data layers, scale to larger regions, and explore alternative architectures to improve calibration and practicality for LULC change assessment.
Abstract
Land use / land cover (LULC) modeling is a challenging task due to long-range dependencies between geographic features and distinct spatial patterns related to topography, ecology, and human development. We identify a close connection between modeling of spatial patterns of land use and the task of image inpainting from computer vision and conduct a study of a modified PixelCNN architecture with approximately 19 million parameters for modeling LULC. In comparison with a benchmark spatial statistical model, we find that the former is capable of capturing much richer spatial correlation patterns such as roads and water bodies but does not produce a calibrated predictive distribution, suggesting the need for additional tuning. We find evidence of predictive underdispersion with regard to important ecologically-relevant land use statistics such as patch count and adjacency which can be ameliorated to some extent by manipulating sampling variability.
