MMA: A Momentum Mamba Architecture for Human Activity Recognition with Inertial Sensors

Thai-Khanh Nguyen; Uyen Vo; Tan M. Nguyen; Thieu N. Vo; Trung-Hieu Le; Cuong Pham

MMA: A Momentum Mamba Architecture for Human Activity Recognition with Inertial Sensors

Thai-Khanh Nguyen, Uyen Vo, Tan M. Nguyen, Thieu N. Vo, Trung-Hieu Le, Cuong Pham

TL;DR

This work targets human activity recognition from inertial sensors, addressing gradient instability and limited long-range modeling in conventional deep models. It introduces Momentum Mamba, a momentum-augmented selective state-space model that injects second-order dynamics via a velocity state, preserving linear-time computation while improving gradient flow and temporal expressiveness. Extensions include Complex Momentum Mamba for frequency-selective memory and Adam Momentum Mamba for variance-aware adaptivity, all designed to robustly model long-horizon inertial sequences. Empirical results on MuWiGes, UESTC-MMEA-CL, and MMAct show consistent accuracy gains over Vanilla Mamba and Transformer baselines, with the Complex Variant providing the strongest performance at the cost of higher resources. The approach offers a scalable, edge-friendly paradigm for HAR and points to broader applicability in sequence modeling tasks requiring stable long-range memory and content-aware processing.

Abstract

Human activity recognition (HAR) from inertial sensors is essential for ubiquitous computing, mobile health, and ambient intelligence. Conventional deep models such as Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), and transformers have advanced HAR but remain limited by vanishing or exloding gradients, high computational cost, and difficulty in capturing long-range dependencies. Structured state-space models (SSMs) like Mamba address these challenges with linear complexity and effective temporal modeling, yet they are restricted to first-order dynamics without stable longterm memory mechanisms. We introduce Momentum Mamba, a momentum-augmented SSM that incorporates second-order dynamics to improve stability of information flow across time steps, robustness, and long-sequence modeling. Two extensions further expand its capacity: Complex Momentum Mamba for frequency-selective memory scaling. Experiments on multiple HAR benchmarks demonstrate consistent gains over vanilla Mamba and Transformer baselines in accuracy, robustness, and convergence speed. With only moderate increases in training cost, momentum-augmented SSMs offer a favorable accuracy-efficiency balance, establishing them as a scalable paradigm for HAR and a promising principal framework for broader sequence modeling applications.

MMA: A Momentum Mamba Architecture for Human Activity Recognition with Inertial Sensors

TL;DR

Abstract

MMA: A Momentum Mamba Architecture for Human Activity Recognition with Inertial Sensors

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (13)