VoxMed: One-Step Respiratory Disease Classifier using Digital Stethoscope Sounds
Paridhi Mundra, Manik Sharma, Yashwardhan Chaudhuri, Orchid Chetia Phukan, Arun Balaji Buduru
TL;DR
Respiratory diseases demand fast, reliable bedside diagnostics. VoxMed offers a UI-assisted, one-step classifier that analyzes digital stethoscope recordings using an Audio Spectrogram Transformer (AST) for feature extraction and a 1-D CNN for disease classification. Evaluated on the ICBHI dataset, AST-based embeddings show competitive accuracy and F1 scores, outperforming several alternative backbones across multiple class configurations. The system supports on-the-spot assessment via a simple upload-and-infer workflow and can be enhanced with API-sourced patient information to aid clinical decisions. Together, VoxMed provides a practical, rapid diagnostic aid with potential to improve patient care and workflow in busy clinical environments.
Abstract
As respiratory illnesses become more common, it is crucial to quickly and accurately detect them to improve patient care. There is a need for improved diagnostic methods for immediate medical assessments for optimal patient outcomes. This paper introduces VoxMed, a UI-assisted one-step classifier that uses digital stethoscope recordings to diagnose respiratory diseases. It employs an Audio Spectrogram Transformer(AST) for feature extraction and a 1-D CNN-based architecture to classify respiratory diseases, offering professionals information regarding their patients respiratory health in seconds. We use the ICBHI dataset, which includes stethoscope recordings collected from patients in Greece and Portugal, to classify respiratory diseases. GitHub repository: https://github.com/Sample-User131001/VoxMed
