INSPECT: A Multimodal Dataset for Pulmonary Embolism Diagnosis and Prognosis
Shih-Cheng Huang, Zepeng Huo, Ethan Steinberg, Chia-Chun Chiang, Matthew P. Lungren, Curtis P. Langlotz, Serena Yeung, Nigam H. Shah, Jason A. Fries
TL;DR
The paper introduces INSPECT, a large-scale multimodal dataset for pulmonary embolism that combines 3D CT pulmonary angiography, radiology report impressions, and longitudinal EHR data from $19{,}402$ patients ($23{,}248$ CTPA studies). It defines eight PE-related diagnostic and prognostic tasks and provides baselines across imaging, EHR, and multimodal fusion, with open-source code and trained weights to enable reproducible evaluation. NLP-based labeling of PE from radiology impressions and careful de-identification under a Data Use Agreement enable public sharing while preserving privacy. Experimental results show imaging-based methods excel in PE diagnosis, EHR methods in prognosis, and fusion improves diagnostic performance but not prognosis, highlighting both the promise and the challenge of multimodal approaches for PE. INSPECT thus lays a foundation for future research in multimodal medical AI, providing a rich resource for benchmarking and method development.
Abstract
Synthesizing information from multiple data sources plays a crucial role in the practice of modern medicine. Current applications of artificial intelligence in medicine often focus on single-modality data due to a lack of publicly available, multimodal medical datasets. To address this limitation, we introduce INSPECT, which contains de-identified longitudinal records from a large cohort of patients at risk for pulmonary embolism (PE), along with ground truth labels for multiple outcomes. INSPECT contains data from 19,402 patients, including CT images, radiology report impression sections, and structured electronic health record (EHR) data (i.e. demographics, diagnoses, procedures, vitals, and medications). Using INSPECT, we develop and release a benchmark for evaluating several baseline modeling approaches on a variety of important PE related tasks. We evaluate image-only, EHR-only, and multimodal fusion models. Trained models and the de-identified dataset are made available for non-commercial use under a data use agreement. To the best of our knowledge, INSPECT is the largest multimodal dataset integrating 3D medical imaging and EHR for reproducible methods evaluation and research.
