Unlocking the diagnostic potential of electrocardiograms through information transfer from cardiac magnetic resonance imaging
Özgün Turgut, Philip Müller, Paul Hager, Suprosanna Shit, Sophie Starck, Martin J. Menten, Eimo Martens, Daniel Rueckert
TL;DR
The paper addresses the limited diagnostic ability of ECG for comprehensive cardiovascular assessment and the cost of CMR imaging. It introduces a self-supervised MMCL framework that first performs masked data modelling on ECG with a Transformer backbone and then applies multimodal contrastive learning with paired ECG-CMR data to transfer morphology information to ECG; the framework uses a shared latent space of dimension $128$, a masking ratio $\rho=0.8$, and a CLIP-style temperature $\tau$. On 40,044 UK Biobank subjects, the method improves subject-specific disease risk prediction for CAD, AF, and DM and enhances prediction of 61 CMR-derived cardiac phenotypes from ECG compared with unimodal baselines. It also provides a patch-based interpretability module and is open-sourced, signaling broad practical impact for affordable, holistic cardiac screening.
Abstract
Cardiovascular diseases (CVD) can be diagnosed using various diagnostic modalities. The electrocardiogram (ECG) is a cost-effective and widely available diagnostic aid that provides functional information of the heart. However, its ability to classify and spatially localise CVD is limited. In contrast, cardiac magnetic resonance (CMR) imaging provides detailed structural information of the heart and thus enables evidence-based diagnosis of CVD, but long scan times and high costs limit its use in clinical routine. In this work, we present a deep learning strategy for cost-effective and comprehensive cardiac screening solely from ECG. Our approach combines multimodal contrastive learning with masked data modelling to transfer domain-specific information from CMR imaging to ECG representations. In extensive experiments using data from 40,044 UK Biobank subjects, we demonstrate the utility and generalisability of our method for subject-specific risk prediction of CVD and the prediction of cardiac phenotypes using only ECG data. Specifically, our novel multimodal pre-training paradigm improves performance by up to 12.19 % for risk prediction and 27.59 % for phenotype prediction. In a qualitative analysis, we demonstrate that our learned ECG representations incorporate information from CMR image regions of interest. Our entire pipeline is publicly available at https://github.com/oetu/MMCL-ECG-CMR.
