Gate-Shift-Pose: Enhancing Action Recognition in Sports with Skeleton Information

Edoardo Bianchi; Oswald Lanz

Gate-Shift-Pose: Enhancing Action Recognition in Sports with Skeleton Information

Edoardo Bianchi, Oswald Lanz

TL;DR

The paper addresses action recognition in sports by incorporating skeletal pose information into RGB-based models. It introduces Gate-Shift-Pose (GSP), which extends Gate-Shift-Fuse with two fusion strategies: early-fusion using pose heatmaps as an input channel and late-fusion using a dual-stream attention-based fusion. On the FR-FS ice-skating dataset, GSP substantially improves accuracy over RGB-only baselines, achieving up to $98.08\%$ with a ResNet50 backbone in early-fusion, and $95.19\%$ with a ResNet18 backbone in late-fusion, while also showing significant gains over the baseline by roughly $20$–$40\%$. The work demonstrates the value of multimodal architectures that integrate skeleton information for capturing complex motion patterns in sports, with practical implications for robust, real-time action recognition and analytics.

Abstract

This paper introduces Gate-Shift-Pose, an enhanced version of Gate-Shift-Fuse networks, designed for athlete fall classification in figure skating by integrating skeleton pose data alongside RGB frames. We evaluate two fusion strategies: early-fusion, which combines RGB frames with Gaussian heatmaps of pose keypoints at the input stage, and late-fusion, which employs a multi-stream architecture with attention mechanisms to combine RGB and pose features. Experiments on the FR-FS dataset demonstrate that Gate-Shift-Pose significantly outperforms the RGB-only baseline, improving accuracy by up to 40% with ResNet18 and 20% with ResNet50. Early-fusion achieves the highest accuracy (98.08%) with ResNet50, leveraging the model's capacity for effective multimodal integration, while late-fusion is better suited for lighter backbones like ResNet18. These results highlight the potential of multimodal architectures for sports action recognition and the critical role of skeleton pose information in capturing complex motion patterns. Visit the project page at https://edowhite.github.io/Gate-Shift-Pose

Gate-Shift-Pose: Enhancing Action Recognition in Sports with Skeleton Information

TL;DR

Abstract

Gate-Shift-Pose: Enhancing Action Recognition in Sports with Skeleton Information

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)