Uncertainty-Aware Deep Attention Recurrent Neural Network for Heterogeneous Time Series Imputation
Linglong Qian, Zina Ibrahim, Richard Dobson
TL;DR
DEARI addresses non-random missingness in heterogeneous multivariate time series by unifying deep attention-based imputation with self-supervised metric learning and Bayesian marginalization. By extending the BRITS backbone with self-attention, residuals, and a deep metric objective, it enables scalable deep architectures that improve imputation accuracy while providing explicit uncertainty estimates. Empirical results across air quality, healthcare, and traffic datasets show deterministic and Bayesian DEARI surpass state-of-the-art baselines, with uncertainty quantification aiding reliability. This approach offers practical benefits for time-series analysis in domains with complex missingness, facilitating both accurate imputations and trustworthy confidence bounds for downstream decisions.
Abstract
Missingness is ubiquitous in multivariate time series and poses an obstacle to reliable downstream analysis. Although recurrent network imputation achieved the SOTA, existing models do not scale to deep architectures that can potentially alleviate issues arising in complex data. Moreover, imputation carries the risk of biased estimations of the ground truth. Yet, confidence in the imputed values is always unmeasured or computed post hoc from model output. We propose DEep Attention Recurrent Imputation (DEARI), which jointly estimates missing values and their associated uncertainty in heterogeneous multivariate time series. By jointly representing feature-wise correlations and temporal dynamics, we adopt a self attention mechanism, along with an effective residual component, to achieve a deep recurrent neural network with good imputation performance and stable convergence. We also leverage self-supervised metric learning to boost performance by optimizing sample similarity. Finally, we transform DEARI into a Bayesian neural network through a novel Bayesian marginalization strategy to produce stochastic DEARI, which outperforms its deterministic equivalent. Experiments show that DEARI surpasses the SOTA in diverse imputation tasks using real-world datasets, namely air quality control, healthcare and traffic.
