Diabetica: Adapting Large Language Model to Enhance Multiple Medical Tasks in Diabetes Care and Management
Lai Wei, Zhen Ying, Muyang He, Yutong Chen, Qian Yang, Yanzhe Hong, Jiaping Lu, Kaipeng Zheng, Shaoting Zhang, Xiaoying Li, Weiran Huang, Ying Chen
TL;DR
Diabetes care faces global resource and knowledge gaps that limit scalable, personalized management. The authors present Diabetica, a diabetes-focused language framework trained via a self-distillation pipeline on a carefully curated Diabetes-QA dataset, paired with a reproducible data processing pipeline (collection, filtering, augmentation, refinement) and dedicated benchmarks (MCQ, fill-in-the-blank, dialogue). They demonstrate state-of-the-art performance on diabetes-specific tasks and validate its utility through online patient consulting, medical education, and clinical record summarization studies, outperforming several open-source peers and approaching or exceeding proprietary systems in key areas. This work delivers a practical path to deploying domain-specific language capabilities in diabetes care, with implications for patient personalization, clinician education, and workflow efficiency.
Abstract
Diabetes is a chronic disease with a significant global health burden, requiring multi-stakeholder collaboration for optimal management. Large language models (LLMs) have shown promise in various healthcare scenarios, but their effectiveness across diverse diabetes tasks remains unproven. Our study introduced a framework to train and validate diabetes-specific LLMs. We first developed a comprehensive data processing pipeline that includes data collection, filtering, augmentation and refinement. This created a high-quality, diabetes-specific dataset and evaluation benchmarks from scratch. Fine-tuned on the collected training dataset, our diabetes-specific LLM family demonstrated state-of-the-art proficiency in processing various diabetes tasks compared to other LLMs. Furthermore, clinical studies revealed the potential applications of our models in diabetes care, including providing personalized healthcare, assisting medical education, and streamlining clinical tasks. Generally, our introduced framework helps develop diabetes-specific LLMs and highlights their potential to enhance clinical practice and provide personalized, data-driven support for diabetes management across different end users. Our codes, benchmarks and models are available at https://github.com/waltonfuture/Diabetica.
