Mitigating the Risk of Health Inequity Exacerbated by Large Language Models
Yuelyu Ji, Wenhe Ma, Sonish Sivarajkumar, Hang Zhang, Eugene Mathew Sadhu, Zhuochun Li, Xizhi Wu, Shyam Visweswaran, Yanshan Wang
TL;DR
The paper addresses health inequity risks arising from Large Language Models in clinical trial matching and medical question answering by revealing how sociodemographic inputs can bias outputs. It introduces EquityGuard, a contrastive-learning framework that disentangles social determinants of health from task embeddings to reduce inequities and improve fairness without sacrificing performance. Across CTM and MQA datasets and several LLMs, EquityGuard yields more uniform task outcomes and lowers fairness gaps as measured by EO and DP, with notable improvements in error rates for underrepresented groups. This framework advances equitable AI in clinical settings and highlights practical considerations for deploying LLMs in healthcare while mitigating disparities.
Abstract
Recent advancements in large language models have demonstrated their potential in numerous medical applications, particularly in automating clinical trial matching for translational research and enhancing medical question answering for clinical decision support. However, our study shows that incorporating non decisive sociodemographic factors such as race, sex, income level, LGBT+ status, homelessness, illiteracy, disability, and unemployment into the input of LLMs can lead to incorrect and harmful outputs for these populations. These discrepancies risk exacerbating existing health disparities if LLMs are widely adopted in healthcare. To address this issue, we introduce EquityGuard, a novel framework designed to detect and mitigate the risk of health inequities in LLM based medical applications. Our evaluation demonstrates its efficacy in promoting equitable outcomes across diverse populations.
