CuPID: Leveraging Masked Single-Lead ECG Modelling for Enhancing the Representations
Adtian Atienza, Gouthamaan Manimaran, Jakob E. Bardram, Sadasivan Puthusserypady
TL;DR
CuPID addresses the challenge of learning robust representations from single-lead ECG by mitigating the lack of cross-lead context in masked data modelling. It does so by cueing the decoder with spectrogram-derived context as the Key in a transformer attention mechanism during pre-training, forcing the encoder to produce more informative patch representations that transfer to downstream tasks. Across MIT-AFIB, LT-AF, and Physionet 2017, CuPID outperforms strong single-lead SSL baselines in linear probing and shows competitive gains under fine-tuning, validating the practical value of spectrogram-guided context in ECG representation learning. The findings highlight the potential of context-aware pre-training to enhance remote cardiovascular monitoring while identifying limitations tied to dataset diversity and pre-training data availability.
Abstract
Wearable sensing devices, such as Electrocardiogram (ECG) heart-rate monitors, will play a crucial role in the future of digital health. This continuous monitoring leads to massive unlabeled data, incentivizing the development of unsupervised learning frameworks. While Masked Data Modelling (MDM) techniques have enjoyed wide use, their direct application to single-lead ECG data is suboptimal due to the decoder's difficulty handling irregular heartbeat intervals when no contextual information is provided. In this paper, we present Cueing the Predictor Increments the Detailing (CuPID), a novel MDM method tailored to single-lead ECGs. CuPID enhances existing MDM techniques by cueing spectrogram-derived context to the decoder, thus incentivizing the encoder to produce more detailed representations. This has a significant impact on the encoder's performance across a wide range of different configurations, leading CuPID to outperform state-of-the-art methods in a variety of downstream tasks.
