Bayes-CATSI: A variational Bayesian deep learning framework for medical time series data imputation
Omkar Kulkarni, Rohitash Chandra
TL;DR
Bayes-CATSI addresses the need for uncertainty-aware imputation in medical time-series data by integrating variational Bayesian layers into the CATSI framework, enabling probabilistic imputation and uncertainty quantification. The approach replaces deterministic deep-learning components with Bayesian counterparts (or partial Bayesian variants) across context-aware recurrent imputation, cross-feature imputation, and the fusion layer, trained via Bayes-by-backprop with an ELBO objective. Empirical results on multi-modal ICU data show Bayes-CATSI achieving a $9.57\%$ improvement in RMSE over CATSI for individual missing values, along with reduced prediction uncertainty; partial Bayes-CATSI offers a trade-off between performance gains and computational cost, performing variably across missingness patterns. The work contributes open-source code and highlights how uncertainty quantification can enhance reliability and decision-making in clinical data imputation, while outlining future work on larger datasets, MCMC sampling, and Transformer-based enhancements.
Abstract
Medical time series datasets feature missing values that need data imputation methods, however, conventional machine learning models fall short due to a lack of uncertainty quantification in predictions. Among these models, the CATSI (Context-Aware Time Series Imputation) stands out for its effectiveness by incorporating a context vector into the imputation process, capturing the global dependencies of each patient. In this paper, we propose a Bayesian Context-Aware Time Series Imputation (Bayes-CATSI) framework which leverages uncertainty quantification offered by variational inference. We consider the time series derived from electroencephalography (EEG), electrooculography (EOG), electromyography (EMG), electrocardiology (EKG). Variational Inference assumes the shape of the posterior distribution and through minimization of the Kullback-Leibler(KL) divergence it finds variational densities that are closest to the true posterior distribution. Thus , we integrate the variational Bayesian deep learning layers into the CATSI model. Our results show that Bayes-CATSI not only provides uncertainty quantification but also achieves superior imputation performance compared to the CATSI model. Specifically, an instance of Bayes-CATSI outperforms CATSI by 9.57 %. We provide an open-source code implementation for applying Bayes-CATSI to other medical data imputation problems.
