Detecting abnormal heart sound using mobile phones and on-device IConNet
Linh Vu, Thu Tran
TL;DR
The paper tackles the need for accessible early screening of cardiovascular disease by detecting abnormal heart sounds directly from audio on mobile devices. It introduces IConNet, an on-device, end-to-end Interpretable CNN that eliminates segmentation and MFCC-based preprocessing. On the PhysioNet/CinC dataset, IConNet achieves about 92.05% F1 with a compact model (~154k parameters, ~493.3 kB), outperforming MFCC-based baselines and CRNN pipelines, though not yet surpassing state-of-the-art ResNet results. The work demonstrates feasible privacy-preserving, on-device screening with interpretable front-end features, supporting trustworthy AI in mobile health and remote monitoring.
Abstract
Given the global prevalence of cardiovascular diseases, there is a pressing need for easily accessible early screening methods. Typically, this requires medical practitioners to investigate heart auscultations for irregular sounds, followed by echocardiography and electrocardiography tests. To democratize early diagnosis, we present a user-friendly solution for abnormal heart sound detection, utilizing mobile phones and a lightweight neural network optimized for on-device inference. Unlike previous approaches reliant on specialized stethoscopes, our method directly analyzes audio recordings, facilitated by a novel architecture known as IConNet. IConNet, an Interpretable Convolutional Neural Network, harnesses insights from audio signal processing, enhancing efficiency and providing transparency in neural pattern extraction from raw waveform signals. This is a significant step towards trustworthy AI in healthcare, aiding in remote health monitoring efforts.
