Intimate Partner Violence and Injury Prediction From Radiology Reports
Irene Y. Chen, Emily Alsentzer, Hyesun Park, Richard Thomas, Babina Gosangi, Rahul Gujrathi, Bharti Khurana
TL;DR
Intimate partner violence (IPV) is a critical, under-detected public health issue; this study leverages radiology reports to predict IPV and associated injuries using ML and NLP methods. The authors compare multiple models, including logistic regression, random forests, gradient boosting, Bag-of-Words neural nets, and ClinicalBERT-based representations, on 34,642 radiology reports from 1,479 patients, achieving a mean AUC of $0.852$ for IPV (sensitivity $0.64$, specificity $0.95$) and $0.887$ for injury. They further demonstrate that IPV can be predicted a median of $3.08$ years before program entry and discuss error analyses across demographic subgroups, as well as deployment challenges and future directions such as incorporating clinical history to improve accuracy and generalizability. Overall, the work shows the feasibility of automated IPV risk stratification from radiology text and highlights important considerations for real-world integration.
Abstract
Intimate partner violence (IPV) is an urgent, prevalent, and under-detected public health issue. We present machine learning models to assess patients for IPV and injury. We train the predictive algorithms on radiology reports with 1) IPV labels based on entry to a violence prevention program and 2) injury labels provided by emergency radiology fellowship-trained physicians. Our dataset includes 34,642 radiology reports and 1479 patients of IPV victims and control patients. Our best model predicts IPV a median of 3.08 years before violence prevention program entry with a sensitivity of 64% and a specificity of 95%. We conduct error analysis to determine for which patients our model has especially high or low performance and discuss next steps for a deployed clinical risk model.
