Beyond Traditional Diagnostics: Transforming Patient-Side Information into Predictive Insights with Knowledge Graphs and Prototypes
Yibowen Zhao, Yinan Zhang, Zhixiang Su, Lizhen Cui, Chunyan Miao
TL;DR
The paper tackles predicting diseases from patient-side information, addressing data imbalance and interpretability by introducing KPI, a framework that fuses a knowledge-grounded disease graph, prototype-guided contrastive learning, and LLM-based explanations. KPI constructs a unified disease knowledge graph, derives patient-specific subgraphs, and aligns narrative embeddings with disease prototypes while enforcing cross-modal consistency. Empirical results on Haodf show KPI outperforms baselines in accuracy and provides clinically valid, patient-tailored explanations, with particular strength on long-tail diseases and efficient inference. The work advances patient-centered triage by delivering interpretable, knowledge-grounded predictions that can support clinicians and patients in early and informed decision-making.
Abstract
Predicting diseases solely from patient-side information, such as demographics and self-reported symptoms, has attracted significant research attention due to its potential to enhance patient awareness, facilitate early healthcare engagement, and improve healthcare system efficiency. However, existing approaches encounter critical challenges, including imbalanced disease distributions and a lack of interpretability, resulting in biased or unreliable predictions. To address these issues, we propose the Knowledge graph-enhanced, Prototype-aware, and Interpretable (KPI) framework. KPI systematically integrates structured and trusted medical knowledge into a unified disease knowledge graph, constructs clinically meaningful disease prototypes, and employs contrastive learning to enhance predictive accuracy, which is particularly important for long-tailed diseases. Additionally, KPI utilizes large language models (LLMs) to generate patient-specific, medically relevant explanations, thereby improving interpretability and reliability. Extensive experiments on real-world datasets demonstrate that KPI outperforms state-of-the-art methods in predictive accuracy and provides clinically valid explanations that closely align with patient narratives, highlighting its practical value for patient-centered healthcare delivery.
