Learning Disease Progression Models That Capture Health Disparities
Erica Chiang, Divya Shanmugam, Ashley N. Beecy, Gabriel Sayer, Deborah Estrin, Nikhil Garg, Emma Pierson
TL;DR
This work tackles bias in disease progression models caused by health disparities across initial severity, progression rate, and visit frequency. It introduces an interpretable Bayesian progression model with group-specific parameters and a Poisson visit process, proving identifiability of all parameters. Theoretical results show that ignoring disparities biases severity estimates, and synthetic experiments validate parameter recovery and bias findings. Application to NewYork-Presbyterian heart failure data reveals higher severity and distinct disparity patterns in non-white groups, with disparities accounting materially shift high-risk classifications. Overall, the study provides a disparity-aware framework for more accurate and equitable disease progression inference that can generalize to other chronic diseases.
Abstract
Disease progression models are widely used to inform the diagnosis and treatment of many progressive diseases. However, a significant limitation of existing models is that they do not account for health disparities that can bias the observed data. To address this, we develop an interpretable Bayesian disease progression model that captures three key health disparities: certain patient populations may (1) start receiving care only when their disease is more severe, (2) experience faster disease progression even while receiving care, or (3) receive follow-up care less frequently conditional on disease severity. We show theoretically and empirically that failing to account for any of these disparities can result in biased estimates of severity (e.g., underestimating severity for disadvantaged groups). On a dataset of heart failure patients, we show that our model can identify groups that face each type of health disparity, and that accounting for these disparities while inferring disease severity meaningfully shifts which patients are considered high-risk.
