Graph Structure Learning for Spatial-Temporal Imputation: Adapting to Node and Feature Scales
Xinyu Yang, Yu Sun, Xinyang Chen, Ying Zhang, Xiaojie Yuan
TL;DR
This work tackles missing data in spatial-temporal sensors by introducing GSLI, a multi-scale graph structure learning framework. It jointly learns node-scale graphs per feature and a feature-scale graph across features, incorporating prominence modeling to weight influential nodes/features. Through cross-feature and cross-temporal representations, GSLI captures rich spatial-temporal dependencies and demonstrates consistent improvements over diverse real datasets across multiple missing-data scenarios. The approach yields robust imputations and shows favorable downstream forecasting performance, with comprehensive analysis of complexity, ablations, and resource use. Overall, GSLI offers a principled, adaptable mechanism to handle feature heterogeneity and varying spatial relationships in spatial-temporal imputation tasks.
Abstract
Spatial-temporal data collected across different geographic locations often suffer from missing values, posing challenges to data analysis. Existing methods primarily leverage fixed spatial graphs to impute missing values, which implicitly assume that the spatial relationship is roughly the same for all features across different locations. However, they may overlook the different spatial relationships of diverse features recorded by sensors in different locations. To address this, we introduce the multi-scale Graph Structure Learning framework for spatial-temporal Imputation (GSLI) that dynamically adapts to the heterogeneous spatial correlations. Our framework encompasses node-scale graph structure learning to cater to the distinct global spatial correlations of different features, and feature-scale graph structure learning to unveil common spatial correlation across features within all stations. Integrated with prominence modeling, our framework emphasizes nodes and features with greater significance in the imputation process. Furthermore, GSLI incorporates cross-feature and cross-temporal representation learning to capture spatial-temporal dependencies. Evaluated on six real incomplete spatial-temporal datasets, GSLI showcases the improvement in data imputation.
