Advancements in Repetitive Action Counting: Joint-Based PoseRAC Model With Improved Performance

Haodong Chen; Ming C. Leu; Md Moniruzzaman; Zhaozheng Yin; Solmaz Hajmohammadi

Advancements in Repetitive Action Counting: Joint-Based PoseRAC Model With Improved Performance

Haodong Chen, Ming C. Leu, Md Moniruzzaman, Zhaozheng Yin, Solmaz Hajmohammadi

TL;DR

This work tackles repetitive action counting (RepCount) by addressing limitations of RGB-frame and landmark-only approaches, notably viewpoint variability and miscounts. It extends PoseRAC by fusing five joint-angle features with pose landmarks, achieving a MAE of $0.211$ and an OBO of $0.599$ on the RepCount dataset, and demonstrates superior accuracy over the prior state-of-the-art in MAE. The method leverages pose saliency concepts and density-map visualization, using a Swin Transformer-based density map and an action-trigger mechanism to identify salient pose sequences across video frames. The approach yields robustness to camera angles, improves discrimination of sub-actions, and enhances salient-pose recognition, offering practical benefits for fitness tracking and rehabilitation contexts.

Abstract

Repetitive counting (RepCount) is critical in various applications, such as fitness tracking and rehabilitation. Previous methods have relied on the estimation of red-green-and-blue (RGB) frames and body pose landmarks to identify the number of action repetitions, but these methods suffer from a number of issues, including the inability to stably handle changes in camera viewpoints, over-counting, under-counting, difficulty in distinguishing between sub-actions, inaccuracy in recognizing salient poses, etc. In this paper, based on the work done by [1], we integrate joint angles with body pose landmarks to address these challenges and achieve better results than the state-of-the-art RepCount methods, with a Mean Absolute Error (MAE) of 0.211 and an Off-By-One (OBO) counting accuracy of 0.599 on the RepCount data set [2]. Comprehensive experimental results demonstrate the effectiveness and robustness of our method.

Advancements in Repetitive Action Counting: Joint-Based PoseRAC Model With Improved Performance

TL;DR

and an OBO of

on the RepCount dataset, and demonstrates superior accuracy over the prior state-of-the-art in MAE. The method leverages pose saliency concepts and density-map visualization, using a Swin Transformer-based density map and an action-trigger mechanism to identify salient pose sequences across video frames. The approach yields robustness to camera angles, improves discrimination of sub-actions, and enhances salient-pose recognition, offering practical benefits for fitness tracking and rehabilitation contexts.

Abstract

Paper Structure (11 sections, 2 equations, 9 figures, 2 tables)

This paper contains 11 sections, 2 equations, 9 figures, 2 tables.

Introduction
Related Works
Contribution
Pose and Joint Angle Annotation
Data Set Annotation Correction
RepCount Visualization - Density Map
Experiments and Results
Experiment Setup
Evaluation of Different Scenarios Using Joint Angles
Visualization Comparison of Models: Landmark-Only vs. Landmarks + Joint Angles
Conclusion

Figures (9)

Figure 1: Pose saliency annotation. Instead of annotating the start and end, the two most salient poses were annotated in yao2023poserac
Figure 2: BlazePose landmarks and five joint angles
Figure 3: Action-trigger mechanism using density map yao2023poserac.
Figure 4: Density maps: landmarks-only vs. landmarks + joint angles - addressing the inability issue to stably deal with changes in camera viewpoints
Figure 5: Density maps: landmarks-only vs. landmarks + joint angles - addressing the over-counting issue
...and 4 more figures

Advancements in Repetitive Action Counting: Joint-Based PoseRAC Model With Improved Performance

TL;DR

Abstract

Advancements in Repetitive Action Counting: Joint-Based PoseRAC Model With Improved Performance

Authors

TL;DR

Abstract

Table of Contents

Figures (9)