MagiNet: Mask-Aware Graph Imputation Network for Incomplete Traffic Data
Jianping Zhou, Bin Lu, Zhanyu Liu, Siyu Pan, Xuejun Feng, Hua Wei, Guanjie Zheng, Xinbing Wang, Chenghu Zhou
TL;DR
This work tackles the challenge of imputing missing traffic data without resorting to pre-filling, which can inject noise and cause over-smoothing. It introduces MagiNet, a mask-aware graph imputation network built from an adaptive mask spatio-temporal encoder (AMSTenc) and a mask-aware spatio-temporal decoder (MASTdec) that leverage mask-aware attention and Chebyshev graph convolution to capture intrinsic spatio-temporal dependencies. Across five real-world datasets, MagiNet consistently outperforms both complete-data baselines with prefill and specialized incomplete-data imputation methods, achieving average improvements of 4.31% in RMSE and 3.72% in MAPE, with robustness across varying missing ratios. The approach advances practical ITS data completion by avoiding noise-prone pre-filling and better preserving spatial and temporal structure during imputation, with potential extensions to probabilistic imputation and large-scale deployment.
Abstract
Due to detector malfunctions and communication failures, missing data is ubiquitous during the collection of traffic data. Therefore, it is of vital importance to impute the missing values to facilitate data analysis and decision-making for Intelligent Transportation System (ITS). However, existing imputation methods generally perform zero pre-filling techniques to initialize missing values, introducing inevitable noises. Moreover, we observe prevalent over-smoothing interpolations, falling short in revealing the intrinsic spatio-temporal correlations of incomplete traffic data. To this end, we propose Mask-Aware Graph imputation Network: MagiNet. Our method designs an adaptive mask spatio-temporal encoder to learn the latent representations of incomplete data, eliminating the reliance on pre-filling missing values. Furthermore, we devise a spatio-temporal decoder that stacks multiple blocks to capture the inherent spatial and temporal dependencies within incomplete traffic data, alleviating over-smoothing imputation. Extensive experiments demonstrate that our method outperforms state-of-the-art imputation methods on five real-world traffic datasets, yielding an average improvement of 4.31% in RMSE and 3.72% in MAPE.
