Sparse learned kernels for interpretable and efficient medical time series processing
Sully F. Chen, Zhicheng Guo, Cheng Ding, Xiao Hu, Cynthia Rudin
TL;DR
This work introduces Sparse Mixture of Learned Kernels (SMoLK), a lightweight, single-layer, sparse neural architecture for medical time-series processing that delivers competitive performance with orders of magnitude fewer parameters. By learning a bank of convolutional kernels and employing weight absorption and correlated kernel pruning, SMoLK achieves efficient, real-time segmentation of PPG artifacts and robust single-lead ECG atrial fibrillation detection while offering inherently interpretable kernel-level contributions. The approach demonstrates strong generalization to out-of-distribution data and maintains performance under quantitative pruning and quantization, making it suitable for low-power wearables. The results suggest that, for certain medical signal tasks, simple, interpretable models can rival deep networks without sacrificing accuracy, enabling deployment on resource-constrained devices and facilitating transparent clinical decision support.
Abstract
Rapid, reliable, and accurate interpretation of medical time-series signals is crucial for high-stakes clinical decision-making. Deep learning methods offered unprecedented performance in medical signal processing but at a cost: they were compute-intensive and lacked interpretability. We propose Sparse Mixture of Learned Kernels (SMoLK), an interpretable architecture for medical time series processing. SMoLK learns a set of lightweight flexible kernels that form a single-layer sparse neural network, providing not only interpretability, but also efficiency, robustness, and generalization to unseen data distributions. We introduce a parameter reduction techniques to reduce the size of SMoLK's networks while maintaining performance. We test SMoLK on two important tasks common to many consumer wearables: photoplethysmography (PPG) artifact detection and atrial fibrillation detection from single-lead electrocardiograms (ECGs). We find that SMoLK matches the performance of models orders of magnitude larger. It is particularly suited for real-time applications using low-power devices, and its interpretability benefits high-stakes situations.
