Air Quality Prediction with A Meteorology-Guided Modality-Decoupled Spatio-Temporal Network
Hang Yin, Yan-Ming Zhang, Jian Xu, Jian-Long Chang, Yin Li, Cheng-Lin Liu
TL;DR
MDSTNet addresses the problem of air quality prediction by explicitly decoupling air pollutants and meteorological data into distinct modalities and leveraging multi-step weather forecasts as dynamic prompts. It introduces a three-branch encoder with Distillation Attention Modules to model spatial, pollutant-variate, and meteorological correlations, followed by a decoder that conditions predictions on forecast information. The authors present ChinaAirNet, a nationwide dataset that combines air quality records with multi-pressure-level meteorology, enabling large-scale evaluation and reproducibility. Experimental results show a 17.54% reduction in 48-hour prediction error over the prior state-of-the-art on ChinaAirNet, with ablations confirming the value of multi-level meteorology, cross-modal attention, and forecast-guided prompts for accurate, horizon-aware predictions.
Abstract
Air quality prediction plays a crucial role in public health and environmental protection. Accurate air quality prediction is a complex multivariate spatiotemporal problem, that involves interactions across temporal patterns, pollutant correlations, spatial station dependencies, and particularly meteorological influences that govern pollutant dispersion and chemical transformations. Existing works underestimate the critical role of atmospheric conditions in air quality prediction and neglect comprehensive meteorological data utilization, thereby impairing the modeling of dynamic interdependencies between air quality and meteorological data. To overcome this, we propose MDSTNet, an encoder-decoder framework that explicitly models air quality observations and atmospheric conditions as distinct modalities, integrating multi-pressure-level meteorological data and weather forecasts to capture atmosphere-pollution dependencies for prediction. Meantime, we construct ChinaAirNet, the first nationwide dataset combining air quality records with multi-pressure-level meteorological observations. Experimental results on ChinaAirNet demonstrate MDSTNet's superiority, substantially reducing 48-hour prediction errors by 17.54\% compared to the state-of-the-art model. The source code and dataset will be available on github.
