Automatic Temporal Segmentation for Post-Stroke Rehabilitation: A Keypoint Detection and Temporal Segmentation Approach for Small Datasets

Jisoo Lee; Tamim Ahmed; Thanassis Rikakis; Pavan Turaga

Automatic Temporal Segmentation for Post-Stroke Rehabilitation: A Keypoint Detection and Temporal Segmentation Approach for Small Datasets

Jisoo Lee, Tamim Ahmed, Thanassis Rikakis, Pavan Turaga

TL;DR

This paper tackles objective post-stroke rehabilitation assessment in aging populations, where real-world data are scarce. It proposes a three-phase framework that combines 2D hand-object keypoint detection with 1D temporal segmentation using Transformer-based encoders, tailored for small datasets and multiple camera views. Utilizing the ASAR ARAT dataset, it demonstrates data refinement techniques to handle occlusion and missing data, and shows that deeper Transformer-based models—especially when combined with LSTM—improve frame-wise segmentation accuracy across views. The work highlights the clinical relevance of biomechanics-informed design and data-quality improvements, with potential to extend to other domains requiring precise action segmentation under small-data constraints.

Abstract

Rehabilitation is essential and critical for post-stroke patients, addressing both physical and cognitive aspects. Stroke predominantly affects older adults, with 75% of cases occurring in individuals aged 65 and older, underscoring the urgent need for tailored rehabilitation strategies in aging populations. Despite the critical role therapists play in evaluating rehabilitation progress and ensuring the effectiveness of treatment, current assessment methods can often be subjective, inconsistent, and time-consuming, leading to delays in adjusting therapy protocols. This study aims to address these challenges by providing a solution for consistent and timely analysis. Specifically, we perform temporal segmentation of video recordings to capture detailed activities during stroke patients' rehabilitation. The main application scenario motivating this study is the clinical assessment of daily tabletop object interactions, which are crucial for post-stroke physical rehabilitation. To achieve this, we present a framework that leverages the biomechanics of movement during therapy sessions. Our solution divides the process into two main tasks: 2D keypoint detection to track patients' physical movements, and 1D time-series temporal segmentation to analyze these movements over time. This dual approach enables automated labeling with only a limited set of real-world data, addressing the challenges of variability in patient movements and limited dataset availability. By tackling these issues, our method shows strong potential for practical deployment in physical therapy settings, enhancing the speed and accuracy of rehabilitation assessments.

Automatic Temporal Segmentation for Post-Stroke Rehabilitation: A Keypoint Detection and Temporal Segmentation Approach for Small Datasets

TL;DR

Abstract

Automatic Temporal Segmentation for Post-Stroke Rehabilitation: A Keypoint Detection and Temporal Segmentation Approach for Small Datasets

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)