ECGtizer: a fully automated digitizing and signal recovery pipeline for electrocardiograms
Alex Lence, Ahmad Fall, Samuel David Cohen, Federica Granese, Jean-Daniel Zucker, Joe-Elie Salem, Edi Prifti
TL;DR
ECGtizer tackles the challenge of turning historical paper ECGs into machine-readable data and recovering missing signal portions for AI-ready analysis. It combines automated lead detection, three pixel-based trace extraction methods, and a UNet-based signal-reconstruction module to yield complete 12-lead, 10-second ECG records. Across real-world (JOCOVID) and public (PTB-XL) datasets, ECGtizer demonstrates superior signal fidelity and feature preservation compared with state-of-the-art tools, and it enables competitive downstream tasks such as TdP-risk classification after retraining on digitized data. The approach significantly broadens access to historical ECG cohorts and supports AI-driven diagnostics, while remaining open source and adaptable to additional ECG formats in future work.
Abstract
Electrocardiograms (ECGs) are essential for diagnosing cardiac pathologies, yet traditional paper-based ECG storage poses significant challenges for automated analysis. This study introduces ECGtizer, an open-source, fully automated tool designed to digitize paper ECGs and recover signals lost during storage. ECGtizer facilitates automated analyses using modern AI methods. It employs automated lead detection, three pixel-based signal extraction algorithms, and a deep learning-based signal reconstruction module. We evaluated ECGtizer on two datasets: a real-life cohort from the COVID-19 pandemic (JOCOVID) and a publicly available dataset (PTB-XL). Performance was compared with two existing methods: the fully automated ECGminer and the semi-automated PaperECG, which requires human intervention. ECGtizer's performance was assessed in terms of signal recovery and the fidelity of clinically relevant feature measurement. Additionally, we tested these tools on a third dataset (GENEREPOL) for downstream AI tasks. Results show that ECGtizer outperforms existing tools, with its ECGtizerFrag algorithm delivering superior signal recovery. While PaperECG demonstrated better outcomes than ECGminer, it required human input. ECGtizer enhances the usability of historical ECG data and supports advanced AI-based diagnostic methods, making it a valuable addition to the field of AI in ECG analysis.
