Deep Gaussian Markov Random Fields for Graph-Structured Dynamical Systems
Fiona Lippert, Bart Kranstauber, E. Emiel van Loon, Patrick Forré
TL;DR
This work extends Deep Gaussian Markov Random Fields to spatiotemporal graph-structured systems (ST-DGMRF) by formulating graph-structured dynamics as a joint space-time GMRF defined via simple spatial and temporal layers. It enables efficient variational learning from limited data and exact posterior inference using conjugate gradients, with linear scaling in the number of time steps and state dimensions under $\mathcal{O}(KN d_{\text{spatial}} L_{\text{spatial}} + KN d_{\text{temporal}} L_{\text{temporal}})$. Empirical results on synthetic advection-diffusion and real-world air quality data show ST-DGMRF outperforms baselines that rely on simplified dependencies, providing more accurate state estimates and well-calibrated uncertainty, even with substantial missing data. The approach combines principled Bayesian inference with scalable deep-layer parameterizations, offering a data-efficient and tractable framework for high-dimensional spatiotemporal inference with partially unknown dynamics.
Abstract
Probabilistic inference in high-dimensional state-space models is computationally challenging. For many spatiotemporal systems, however, prior knowledge about the dependency structure of state variables is available. We leverage this structure to develop a computationally efficient approach to state estimation and learning in graph-structured state-space models with (partially) unknown dynamics and limited historical data. Building on recent methods that combine ideas from deep learning with principled inference in Gaussian Markov random fields (GMRF), we reformulate graph-structured state-space models as Deep GMRFs defined by simple spatial and temporal graph layers. This results in a flexible spatiotemporal prior that can be learned efficiently from a single time sequence via variational inference. Under linear Gaussian assumptions, we retain a closed-form posterior, which can be sampled efficiently using the conjugate gradient method, scaling favourably compared to classical Kalman filter based approaches
