Rehabilitation Exercise Quality Assessment through Supervised Contrastive Learning with Hard and Soft Negatives
Mark Karlov, Ali Abedi, Shehroz S. Khan
TL;DR
This work tackles cross-exercise-type rehabilitation exercise quality assessment when per-type data are scarce by proposing a single Spatial-Temporal Graph Convolutional Network (ST-GCN) trained with supervised contrastive learning using hard and soft negatives to leverage all available data. A reference representation per exercise type enables inference through cosine similarity, yielding robust, generalizable quality assessments across UI-PRMD, IRDS, and KIMORE. The approach achieves state-of-the-art accuracy and AUC while significantly reducing model count compared to per-type models, and it demonstrates effective transfer learning to KIMORE. These results advance practical home-based virtual rehabilitation by enabling a unified, scalable assessment framework with strong cross-dataset performance and potential for further interpretability and multitask extensions.
Abstract
Exercise-based rehabilitation programs have proven to be effective in enhancing the quality of life and reducing mortality and rehospitalization rates. AI-driven virtual rehabilitation, which allows patients to independently complete exercises at home, utilizes AI algorithms to analyze exercise data, providing feedback to patients and updating clinicians on their progress. These programs commonly prescribe a variety of exercise types, leading to a distinct challenge in rehabilitation exercise assessment datasets: while abundant in overall training samples, these datasets often have a limited number of samples for each individual exercise type. This disparity hampers the ability of existing approaches to train generalizable models with such a small sample size per exercise type. Addressing this issue, this paper introduces a novel supervised contrastive learning framework with hard and soft negative samples that effectively utilizes the entire dataset to train a single model applicable to all exercise types. This model, with a Spatial-Temporal Graph Convolutional Network (ST-GCN) architecture, demonstrated enhanced generalizability across exercises and a decrease in overall complexity. Through extensive experiments on three publicly available rehabilitation exercise assessment datasets, UI-PRMD, IRDS, and KIMORE, our method has proven to surpass existing methods, setting a new benchmark in rehabilitation exercise quality assessment.
