Improving performance of heart rate time series classification by grouping subjects
Michael Beekhuizen, Arman Naseri, David Tax, Ivo van der Bilt, Marcel Reinders
TL;DR
Heart rate time series are noisier and provide less detail for activity classification than ECG/PPG, with substantial inter-subject variability. The study uses the BigIdeasLab_STEP dataset to evaluate window/stride effects, clustering of subjects, and the impact of handcrafted features on deep learning classifiers, along with misclassification analysis. Key findings show that larger window sizes and smaller strides improve accuracy, clustering subjects into similar groups reduces inter-subject variability, and incorporating handcrafted features with DL yields further gains, while misclassifications cluster around activity transitions. The results suggest a path toward semi-personalized HR-based activity recognition, though generalization is limited by dataset size and device heterogeneity, and future work should explore additional normalization strategies and richer subject metadata.
Abstract
Unlike the more commonly analyzed ECG or PPG data for activity classification, heart rate time series data is less detailed, often noisier and can contain missing data points. Using the BigIdeasLab_STEP dataset, which includes heart rate time series annotated with specific tasks performed by individuals, we sought to determine if general classification was achievable. Our analyses showed that the accuracy is sensitive to the choice of window/stride size. Moreover, we found variable classification performances between subjects due to differences in the physical structure of their hearts. Various techniques were used to minimize this variability. First of all, normalization proved to be a crucial step and significantly improved the performance. Secondly, grouping subjects and performing classification inside a group helped to improve performance and decrease inter-subject variability. Finally, we show that including handcrafted features as input to a deep learning (DL) network improves the classification performance further. Together, these findings indicate that heart rate time series can be utilized for classification tasks like predicting activity. However, normalization or grouping techniques need to be chosen carefully to minimize the issue of subject variability.
