Large-scale Long-tailed Disease Diagnosis on Radiology Images
Qiaoyu Zheng, Weike Zhao, Chaoyi Wu, Xiaoman Zhang, Lisong Dai, Hengyu Guan, Yuehua Li, Ya Zhang, Yanfeng Wang, Weidi Xie
TL;DR
RadDiag introduces aTransformer-based, multimodal radiology foundation model capable of ingesting arbitrary numbers of 2D and 3D scans across modalities for case-level, multi-label disease diagnosis. The RP3D-DiagDS dataset, drawn from Radiopaedia, provides over 40k cases spanning 9 modalities and 7 anatomies with 5,568 disorders mapped to 930 ICD-10-CM codes, enabling long-tailed learning. A knowledge-enhanced training pipeline supervises a vision encoder with a medical-text backbone to improve discrimination among rare diseases, and a fusion module aggregates multi-scan information at case level. Empirical results show strong internal performance ($\approx$ $95\%$ AUC), substantial zero-shot and finetuning transfer to external benchmarks, and robust generalization across modalities and anatomies, underscoring the value of publicly available medical data for building generalist AI in healthcare.
Abstract
Developing a generalist radiology diagnosis system can greatly enhance clinical diagnostics. In this paper, we introduce RadDiag, a foundational model supporting 2D and 3D inputs across various modalities and anatomies, using a transformer-based fusion module for comprehensive disease diagnosis. Due to patient privacy concerns and the lack of large-scale radiology diagnosis datasets, we utilize high-quality, clinician-reviewed radiological images available online with diagnosis labels. Our dataset, RP3D-DiagDS, contains 40,936 cases with 195,010 scans covering 5,568 disorders (930 unique ICD-10-CM codes). Experimentally, our RadDiag achieves 95.14% AUC on internal evaluation with the knowledge-enhancement strategy. Additionally, RadDiag can be zero-shot applied or fine-tuned to external diagnosis datasets sourced from various hospitals, demonstrating state-of-the-art results. In conclusion, we show that publicly shared medical data on the Internet is a tremendous and valuable resource that can potentially support building a generalist AI for healthcare.
