LubDubDecoder: Bringing Micro-Mechanical Cardiac Monitoring to Hearables
Siqi Zhang, Xiyuxing Zhang, Duc Vu, Tao Qiang, Clara Palacios, Jiangyifei Zhu, Yuntao Wang, Mayank Goel, Justin Chan
TL;DR
This paper introduces LubDubDecoder, a hearable-based system that converts the in-ear speaker into an acoustic sensor to reconstruct fine micro-mechanical cardiac signals (SCG and GCG) from ear-based lub-dub sounds. It achieves cross-device generalization via a zero-effort normalization and a lightweight calibration step, and demonstrates robust performance across remounting, music playback, and real-world conditions in a feasibility study with 25 participants. The core contributions include a cross-modal autoencoder for ear sounds to SCG/GCG reconstruction, an automated fiducial-point labeling method, motion artifact removal, and cross-user and cross-device generalization strategies, yielding high waveform correlation (0.88–0.95) and low timing error (median near 0–0.5% of the cardiac cycle). The work advances ubiquitous, unobtrusive cardiac monitoring by enabling micro-cardiac event timing from everyday hearables, which can support early screening and continuous health insights in real-world settings.
Abstract
We present LubDubDecoder, a system that enables fine-grained monitoring of micro-cardiac vibrations associated with the opening and closing of heart valves across a range of hearables. Our system transforms the built-in speaker, the only transducer common to all hearables, into an acoustic sensor that captures the coarse "lub-dub" heart sounds, leverages their shared temporal and spectral structure to reconstruct the subtle seismocardiography (SCG) and gyrocardiography (GCG) waveforms, and extract the timing of key micro-cardiac events. In an IRB-approved feasibility study with 25 users, our system achieves correlations of 0.88-0.95 compared to chest-mounted reference measurements in within-user and cross-user evaluations, and generalizes to unseen hearables using a zero-effort adaptation scheme with a correlation of 0.91. Our system is robust across remounting sessions and music playback.
