GST-UNet: A Neural Framework for Spatiotemporal Causal Inference with Time-Varying Confounding
Miruna Oprescu, David K. Park, Xihaier Luo, Shinjae Yoo, Nathan Kallus
TL;DR
GST-UNet addresses causal inference in spatiotemporal data with time-varying confounding and interference, enabling estimation of location-specific potential outcomes from a single observed trajectory. It fuses a U-Net–ConvLSTM spatiotemporal encoder with iterative G-computation heads, under a representation-based time-invariance embedding and a curriculum-based training regime to stabilize learning of recursive pseudo-outcomes. The approach provides identification and consistency guarantees and is validated on synthetic data and a real-world Camp Fire health analysis, showing superior accuracy and stable counterfactual estimates compared to baselines. This yields a principled, ready-to-use tool for policy-relevant and scientific studies involving complex spatiotemporal causal effects. The combination of theory, architecture, and empirical results advances reliable spatiotemporal causal inference in domains like public health and environmental policy.
Abstract
Estimating causal effects from spatiotemporal observational data is essential in public health, environmental science, and policy evaluation, where randomized experiments are often infeasible. Existing approaches, however, either rely on strong structural assumptions or fail to handle key challenges such as interference, spatial confounding, temporal carryover, and time-varying confounding -- where covariates are influenced by past treatments and, in turn, affect future ones. We introduce GST-UNet (G-computation Spatio-Temporal UNet), a theoretically grounded neural framework that combines a U-Net-based spatiotemporal encoder with regression-based iterative G-computation to estimate location-specific potential outcomes under complex intervention sequences. GST-UNet explicitly adjusts for time-varying confounders and captures non-linear spatial and temporal dependencies, enabling valid causal inference from a single observed trajectory in data-scarce settings. We validate its effectiveness in synthetic experiments and in a real-world analysis of wildfire smoke exposure and respiratory hospitalizations during the 2018 California Camp Fire. Together, these results position GST-UNet as a principled and ready-to-use framework for spatiotemporal causal inference, advancing reliable estimation in policy-relevant and scientific domains.
