Transfer Learning with Clinical Concept Embeddings from Large Language Models
Yuhe Gao, Runxue Bao, Yuelyu Ji, Yiming Sun, Chenxi Song, Jeffrey P. Ferraro, Ye Ye
TL;DR
This study investigates how semantic clinical concept embeddings from LLMs affect local and cross-site transfer learning for influenza detection in EHRs across two hospitals. It compares One-Hot baselines, BERT-family embeddings (BioBERT, Med-BERT) and OpenAI embeddings under local and transfer settings using a 1-D CNN classifier, with careful attention to fine-tuning strategies. The findings indicate domain-specific embeddings like Med-BERT yield the best local performance and transfer portability, while generic embeddings require more tuning, and excessive fine-tuning can reduce effectiveness. The work offers practical guidance on selecting embeddings and tuning strategies to enable robust knowledge sharing across healthcare sites.
Abstract
Knowledge sharing is crucial in healthcare, especially when leveraging data from multiple clinical sites to address data scarcity, reduce costs, and enable timely interventions. Transfer learning can facilitate cross-site knowledge transfer, but a major challenge is heterogeneity in clinical concepts across different sites. Large Language Models (LLMs) show significant potential of capturing the semantic meaning of clinical concepts and reducing heterogeneity. This study analyzed electronic health records from two large healthcare systems to assess the impact of semantic embeddings from LLMs on local, shared, and transfer learning models. Results indicate that domain-specific LLMs, such as Med-BERT, consistently outperform in local and direct transfer scenarios, while generic models like OpenAI embeddings require fine-tuning for optimal performance. However, excessive tuning of models with biomedical embeddings may reduce effectiveness, emphasizing the need for balance. This study highlights the importance of domain-specific embeddings and careful model tuning for effective knowledge transfer in healthcare.
