Hengqin-RA-v1: Advanced Large Language Model for Diagnosis and Treatment of Rheumatoid Arthritis with Dataset based Traditional Chinese Medicine
Yishen Liu, Shengda Luo, Zishao Zhong, Tongtong Wu, Jianguo Zhang, Peiyao Ou, Yong Liang, Liang Liu, Hudan Pan
TL;DR
This work tackles the biases and data gaps of English-centric LLMs in Chinese medical and TCM contexts by introducing Hengqin-RA-v1, the first RA-focused LLM tailored to Traditional Chinese Medicine, and HQ-GCM-RA-C1, a comprehensive RA-centric dataset derived from ancient texts, modern literature, and clinical materials. The authors propose a progressive, data-centric training pipeline that combines structured medical-record reasoning with retrieval-enhanced generation and instance-oriented context, anchored by the CMeKG knowledge graph and domain-specific instruction data. Experimental results show Hengqin-RA-v1 achieving superior performance on TCM-RA tasks, including a 54% passing rate on TCM exams and qualitative improvements in diagnostic and treatment guidance, though some clinical-detail gaps remain. The dataset and model collectively aim to reduce bias, improve cultural and clinical fidelity in Chinese RA care, and pave the way for subsequent generations (v2, v3) and broader arthritis-focused TCMed AI systems.
Abstract
Large language models (LLMs) primarily trained on English texts, often face biases and inaccuracies in Chinese contexts. Their limitations are pronounced in fields like Traditional Chinese Medicine (TCM), where cultural and clinical subtleties are vital, further hindered by a lack of domain-specific data, such as rheumatoid arthritis (RA). To address these issues, this paper introduces Hengqin-RA-v1, the first large language model specifically tailored for TCM with a focus on diagnosing and treating RA. We also present HQ-GCM-RA-C1, a comprehensive RA-specific dataset curated from ancient Chinese medical literature, classical texts, and modern clinical studies. This dataset empowers Hengqin-RA-v1 to deliver accurate and culturally informed responses, effectively bridging the gaps left by general-purpose models. Extensive experiments demonstrate that Hengqin-RA-v1 outperforms state-of-the-art models, even surpassing the diagnostic accuracy of TCM practitioners in certain cases.
