Integrating Social Determinants of Health into Knowledge Graphs: Evaluating Prediction Bias and Fairness in Healthcare
Tianqi Shang, Weiqing He, Tianlong Chen, Ying Ding, Huanmei Wu, Kaixiong Zhou, Li Shen
TL;DR
This paper tackles integrating Social Determinants of Health (SDoH) into biomedical knowledge graphs and assesses prediction bias in a drug–disease link task. It constructs an SDoH-enriched KG from MIMIC-III, MIMIC-SBDH, and PrimeKG, and introduces a fairness formulation for graph embeddings that enforces invariance to sensitive SDoH attributes. A heterogeneous-GCN is trained for link prediction, biases with respect to different SDoH are detected, and a post-processing edge-reweighting scheme is proposed to mitigate SDoH-related bias while preserving predictive performance. The results demonstrate substantial bias reduction across multiple SDoH categories with only negligible changes in Mean Reciprocal Rank, underscoring the practicality of balancing fairness and accuracy in healthcare recommendations.
Abstract
Social determinants of health (SDoH) play a crucial role in patient health outcomes, yet their integration into biomedical knowledge graphs remains underexplored. This study addresses this gap by constructing an SDoH-enriched knowledge graph using the MIMIC-III dataset and PrimeKG. We introduce a novel fairness formulation for graph embeddings, focusing on invariance with respect to sensitive SDoH information. Via employing a heterogeneous-GCN model for drug-disease link prediction, we detect biases related to various SDoH factors. To mitigate these biases, we propose a post-processing method that strategically reweights edges connected to SDoHs, balancing their influence on graph representations. This approach represents one of the first comprehensive investigations into fairness issues within biomedical knowledge graphs incorporating SDoH. Our work not only highlights the importance of considering SDoH in medical informatics but also provides a concrete method for reducing SDoH-related biases in link prediction tasks, paving the way for more equitable healthcare recommendations. Our code is available at \url{https://github.com/hwq0726/SDoH-KG}.
