A Temporally Disentangled Contrastive Diffusion Model for Spatiotemporal Imputation
Yakun Chen, Kaize Shi, Zhangkai Wu, Juan Chen, Xianzhi Wang, Julian McAuley, Guandong Xu, Shui Yu
TL;DR
This work tackles spatiotemporal imputation under non-stationarity by introducing C$^2$TSD, a conditional diffusion model that disentangles temporal structure into trend and seasonality and augments learning with contrastive representation. By conditioning the reverse diffusion on disentangled temporal features and spatial context from a Graph Neural Network, and by employing temporal and spatial attention in the noise predictor, the method achieves accurate and probabilistic imputations while mitigating error accumulation typical of recurrent approaches. Comprehensive experiments on AQI-36, PEMS-BAY, and METR-LA demonstrate consistent improvements over strong baselines, with ablations confirming the critical roles of contrastive learning and temporal disentanglement. The proposed approach advances practical spatiotemporal imputation by delivering robust generalization across unseen distributions and missing patterns, at the cost of higher computational demand inherent to diffusion models.
Abstract
Spatiotemporal data analysis is pivotal across various domains, such as transportation, meteorology, and healthcare. The data collected in real-world scenarios are often incomplete due to device malfunctions and network errors. Spatiotemporal imputation aims to predict missing values by exploiting the spatial and temporal dependencies in the observed data. Traditional imputation approaches based on statistical and machine learning techniques require the data to conform to their distributional assumptions, while graph and recurrent neural networks are prone to error accumulation problems due to their recurrent structures. Generative models, especially diffusion models, can potentially circumvent the reliance on inaccurate, previously imputed values for future predictions; However, diffusion models still face challenges in generating stable results. We propose to address these challenges by designing conditional information to guide the generative process and expedite the training process. We introduce a conditional diffusion framework called C$^2$TSD, which incorporates disentangled temporal (trend and seasonality) representations as conditional information and employs contrastive learning to improve generalizability. Our extensive experiments on three real-world datasets demonstrate the superior performance of our approach compared to a number of state-of-the-art baselines.
