A Perspective on Individualized Treatment Effects Estimation from Time-series Health Data
Ghadeer O. Ghosheh, Moritz Gögl, Tingting Zhu
TL;DR
This paper surveys the estimation of individualized treatment effects (ITE) from time-series health data, contrasting randomized controlled trials with observational EHR data and outlining the challenges posed by time-varying confounding. It categorizes time-series ITE methods into outcome-estimation approaches (Marginal Structural Models, G-formula, and balanced representations) and deconfounding strategies (latent factor models and noisy proxies), detailing their assumptions and architectures. The review emphasizes the prevalence of simulated and semi-synthetic datasets for validation and notes the scarcity of real-world counterfactual benchmarks, while outlining future directions such as irregular sampling, missing data handling, and connections to dynamic reinforcement learning. Overall, the work serves as a comprehensive resource for researchers at the intersection of causal inference, time-series modeling, and personalized medicine, guiding methodological development and practical deployment in real-world health settings.
Abstract
The burden of diseases is rising worldwide, with unequal treatment efficacy for patient populations that are underrepresented in clinical trials. Healthcare, however, is driven by the average population effect of medical treatments and, therefore, operates in a "one-size-fits-all" approach, not necessarily what best fits each patient. These facts suggest a pressing need for methodologies to study individualized treatment effects (ITE) to drive personalized treatment. Despite the increased interest in machine-learning-driven ITE estimation models, the vast majority focus on tabular data with limited review and understanding of methodologies proposed for time-series electronic health records (EHRs). To this end, this work provides an overview of ITE works for time-series data and insights into future research. The work summarizes the latest work in the literature and reviews it in light of theoretical assumptions, types of treatment settings, and computational frameworks. Furthermore, this work discusses challenges and future research directions for ITEs in a time-series setting. We hope this work opens new directions and serves as a resource for understanding one of the exciting yet under-studied research areas.
