GraphCare: Enhancing Healthcare Predictions with Personalized Knowledge Graphs
Pengcheng Jiang, Cao Xiao, Adam Cross, Jimeng Sun
TL;DR
GraphCare tackles the challenge of personalized healthcare predictions by generating patient-specific knowledge graphs from external knowledge sources, including LLM-derived triples and established biomedical KGs, and by processing them with a Bi-attention AugmenTed (BAT) GNN. The method first creates concept-level KGs, then composes a patient-level KG with temporal visit data, and finally applies BAT to produce robust predictions across mortality, readmission, LOS, and drug recommendation tasks. Empirical results on MIMIC-III and MIMIC-IV show clear gains over strong baselines, with particular strength in data-scarce settings and clear interpretability via KG-based importance scores. The work highlights the practical potential of integrating open-world knowledge into clinical prediction pipelines while addressing ethical considerations and privacy concerns.
Abstract
Clinical predictive models often rely on patients' electronic health records (EHR), but integrating medical knowledge to enhance predictions and decision-making is challenging. This is because personalized predictions require personalized knowledge graphs (KGs), which are difficult to generate from patient EHR data. To address this, we propose \textsc{GraphCare}, an open-world framework that uses external KGs to improve EHR-based predictions. Our method extracts knowledge from large language models (LLMs) and external biomedical KGs to build patient-specific KGs, which are then used to train our proposed Bi-attention AugmenTed (BAT) graph neural network (GNN) for healthcare predictions. On two public datasets, MIMIC-III and MIMIC-IV, \textsc{GraphCare} surpasses baselines in four vital healthcare prediction tasks: mortality, readmission, length of stay (LOS), and drug recommendation. On MIMIC-III, it boosts AUROC by 17.6\% and 6.6\% for mortality and readmission, and F1-score by 7.9\% and 10.8\% for LOS and drug recommendation, respectively. Notably, \textsc{GraphCare} demonstrates a substantial edge in scenarios with limited data availability. Our findings highlight the potential of using external KGs in healthcare prediction tasks and demonstrate the promise of \textsc{GraphCare} in generating personalized KGs for promoting personalized medicine.
