Shadow and Light: Digitally Reconstructed Radiographs for Disease Classification
Benjamin Hou, Qingqing Zhu, Tejas Sudarshan Mathai, Qiao Jin, Zhiyong Lu, Ronald M. Summers
TL;DR
This work presents DRR-RATE, a large synthetic chest X-ray dataset generated from the CT-RATE corpus to enable paired X-ray images, radiology reports, and 18 pathology labels, including lateral views. Using Siddon-Jacobs ray tracing, DRRs are produced at 512×512 resolution with a -100 HU threshold, and a CC BY-NC-SA license; the dataset contains 50,188 DRRs from 21,304 patients. CheXnet is evaluated on both CheXpert and DRR-RATE, showing strong performance for Cardiomegaly and Pleural Effusion on DRR-RATE, with domain-shift effects observed when models trained on real X-rays are tested on synthetic DRRs. The work demonstrates the viability of CT-derived, text-annotated DRR data for multimodal AI in radiology, while discussing limitations and societal considerations for AI deployment in healthcare.
Abstract
In this paper, we introduce DRR-RATE, a large-scale synthetic chest X-ray dataset derived from the recently released CT-RATE dataset. DRR-RATE comprises of 50,188 frontal Digitally Reconstructed Radiographs (DRRs) from 21,304 unique patients. Each image is paired with a corresponding radiology text report and binary labels for 18 pathology classes. Given the controllable nature of DRR generation, it facilitates the inclusion of lateral view images and images from any desired viewing position. This opens up avenues for research into new and novel multimodal applications involving paired CT, X-ray images from various views, text, and binary labels. We demonstrate the applicability of DRR-RATE alongside existing large-scale chest X-ray resources, notably the CheXpert dataset and CheXnet model. Experiments demonstrate that CheXnet, when trained and tested on the DRR-RATE dataset, achieves sufficient to high AUC scores for the six common pathologies cited in common literature: Atelectasis, Cardiomegaly, Consolidation, Lung Lesion, Lung Opacity, and Pleural Effusion. Additionally, CheXnet trained on the CheXpert dataset can accurately identify several pathologies, even when operating out of distribution. This confirms that the generated DRR images effectively capture the essential pathology features from CT images. The dataset and labels are publicly accessible at https://huggingface.co/datasets/farrell236/DRR-RATE.
