SasMamba: A Lightweight Structure-Aware Stride State Space Model for 3D Human Pose Estimation

Hu Cui; Wenqiang Hua; Renjing Huang; Shurui Jia; Tessai Hayama

SasMamba: A Lightweight Structure-Aware Stride State Space Model for 3D Human Pose Estimation

Hu Cui, Wenqiang Hua, Renjing Huang, Shurui Jia, Tessai Hayama

TL;DR

SasMamba tackles monocular 3D human pose estimation by preserving skeletal topology through Skeleton Structure-Aware Stride SSM (SAS-SSM). It combines a structure-aware spatiotemporal convolution with a stride-based scan to build multi-scale global representations while maintaining linear computational complexity, and integrates this into a lightweight SasMamba model. The approach achieves competitive or state-of-the-art results on Human3.6M and MPI-INF-3DHP with far fewer parameters and computations than Transformer-based or hybrid architectures, demonstrating strong efficiency and scalability. This structure-aware, multi-scale SSM framework offers practical benefits for real-time or resource-constrained 3D pose estimation while preserving spatial integrity and long-range dependencies.

Abstract

Recently, the Mamba architecture based on State Space Models (SSMs) has gained attention in 3D human pose estimation due to its linear complexity and strong global modeling capability. However, existing SSM-based methods typically apply manually designed scan operations to flatten detected 2D pose sequences into purely temporal sequences, either locally or globally. This approach disrupts the inherent spatial structure of human poses and entangles spatial and temporal features, making it difficult to capture complex pose dependencies. To address these limitations, we propose the Skeleton Structure-Aware Stride SSM (SAS-SSM), which first employs a structure-aware spatiotemporal convolution to dynamically capture essential local interactions between joints, and then applies a stride-based scan strategy to construct multi-scale global structural representations. This enables flexible modeling of both local and global pose information while maintaining linear computational complexity. Built upon SAS-SSM, our model SasMamba achieves competitive 3D pose estimation performance with significantly fewer parameters compared to existing hybrid models. The source code is available at https://hucui2022.github.io/sasmamba_proj/.

SasMamba: A Lightweight Structure-Aware Stride State Space Model for 3D Human Pose Estimation

TL;DR

Abstract

SasMamba: A Lightweight Structure-Aware Stride State Space Model for 3D Human Pose Estimation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)