Interpretable Air Pollution Forecasting by Physics-Guided Spatiotemporal Decoupling
Zhiguo Zhang, Xiaoliang Ma, Daniel Schlesinger
TL;DR
This work tackles the challenge of accurate and interpretable urban air quality forecasting by introducing a physics-guided spatiotemporal framework that decouples transport and local dynamics into two additive modules. A wind-conditioned, dynamic advection kernel models cross-station transport, while a station-wise interpretable attention encoder captures local temporal dependencies and attributions. The approach provides spatiotemporal interpretability through dynamic upwind weights and attention-based local attributions, and demonstrates superior predictive performance on a Stockholm PM$_{10}$ dataset relative to state-of-the-art baselines. Practically, the method offers reliable forecasts with transparent explanations suitable for operational air-quality management, and the experimental results include a case study that corroborates meteorology-consistent attributions. Overall, the combination of a learnable physics prior with an interpretable neural architecture delivers both accuracy and diagnostic insight for city-scale air pollution forecasting.
Abstract
Accurate and interpretable air pollution forecasting is crucial for public health, but most models face a trade-off between performance and interpretability. This study proposes a physics-guided, interpretable-by-design spatiotemporal learning framework. The model decomposes the spatiotemporal behavior of air pollutant concentrations into two transparent, additive modules. The first is a physics-guided transport kernel with directed weights conditioned on wind and geography (advection). The second is an explainable attention mechanism that learns local responses and attributes future concentrations to specific historical lags and exogenous drivers. Evaluated on a comprehensive dataset from the Stockholm region, our model consistently outperforms state-of-the-art baselines across multiple forecasting horizons. Our model's integration of high predictive performance and spatiotemporal interpretability provides a more reliable foundation for operational air-quality management in real-world applications.
