AirPhyNet: Harnessing Physics-Guided Neural Networks for Air Quality Prediction
Kethmi Hirushini Hettige, Jiahao Ji, Shili Xiang, Cheng Long, Gao Cong, Jingyuan Wang
TL;DR
The paper tackles accurate, interpretable PM2.5 forecasting in urban air quality, especially under data sparsity. It proposes AirPhyNet, a physics-guided neural network that embeds diffusion and advection through a Diffusion-Advection differential equation network within a graph-based, end-to-end learning framework. On Beijing and Shenzhen datasets, AirPhyNet outperforms state-of-the-art baselines across 24h–72h forecasts and in sudden-change scenarios, with improvements up to around 10% in predictive error. Ablation and case-study analyses demonstrate the necessity of combining diffusion and advection and show the model yields physically meaningful predictions aligned with wind fields.
Abstract
Air quality prediction and modelling plays a pivotal role in public health and environment management, for individuals and authorities to make informed decisions. Although traditional data-driven models have shown promise in this domain, their long-term prediction accuracy can be limited, especially in scenarios with sparse or incomplete data and they often rely on black-box deep learning structures that lack solid physical foundation leading to reduced transparency and interpretability in predictions. To address these limitations, this paper presents a novel approach named Physics guided Neural Network for Air Quality Prediction (AirPhyNet). Specifically, we leverage two well-established physics principles of air particle movement (diffusion and advection) by representing them as differential equation networks. Then, we utilize a graph structure to integrate physics knowledge into a neural network architecture and exploit latent representations to capture spatio-temporal relationships within the air quality data. Experiments on two real-world benchmark datasets demonstrate that AirPhyNet outperforms state-of-the-art models for different testing scenarios including different lead time (24h, 48h, 72h), sparse data and sudden change prediction, achieving reduction in prediction errors up to 10%. Moreover, a case study further validates that our model captures underlying physical processes of particle movement and generates accurate predictions with real physical meaning.
