Prospects for AI-Enhanced ECG as a Unified Screening Tool for Cardiac and Non-Cardiac Conditions -- An Explorative Study in Emergency Care
Nils Strodthoff, Juan Miguel Lopez Alcaraz, Wilhelm Haverkamp
TL;DR
This exploratory study assesses whether a single 12-lead ECG can serve as a unified screening signal for predicting a broad spectrum of cardiac and non-cardiac discharge diagnoses. Using the public MIMIC-IV-ECG-ICD-ED data, the authors train multi-label classifiers (comparing XResNet1d50 and S4) to predict 1076 ICD-10 codes, achieving AUROCs above 0.8 for hundreds of codes, including 81 cardiac and 172 non-cardiac categories, with strong results in the ED setting. External validation on CODE-test corroborates performance across diverse cardiac conditions, and the S4 architecture consistently outperforms the convolutional baseline across training/evaluation scenarios; the best ED-focused model achieves a macro AUROC of about $0.774$ on ED-to-ED predictions. The work argues for the potential of AI-enhanced ECG as a scalable screening tool in emergency care, while acknowledging limitations inherent to label proxies, confounding factors, and distribution shifts, and outlining directions toward explainability and richer multi-modal inputs. Overall, the study provides evidence that ECG data, when paired with large-scale, well-curated public datasets and rigorous evaluation, can support broad, fine-grained screening across diverse clinical conditions, with meaningful implications for ED triage and patient profiling.
Abstract
Current deep learning algorithms designed for automatic ECG analysis have exhibited notable accuracy. However, akin to traditional electrocardiography, they tend to be narrowly focused and typically address a singular diagnostic condition. In this exploratory study, we specifically investigate the capability of a single model to predict a diverse range of both cardiac and non-cardiac discharge diagnoses based on a sole ECG collected in the emergency department. We find that 253, 81 cardiac, and 172 non-cardiac, ICD codes can be reliably predicted in the sense of exceeding an AUROC score of 0.8 in a statistically significant manner. This underscores the model's proficiency in handling a wide array of cardiac and non-cardiac diagnostic scenarios which demonstrates potential as a screening tool for diverse medical encounters.
