PIM: Physics-Informed Multi-task Pre-training for Improving Inertial Sensor-Based Human Activity Recognition

Dominique Nshimyimana; Vitor Fortes Rey; Sungho Suh; Bo Zhou; Paul Lukowicz

PIM: Physics-Informed Multi-task Pre-training for Improving Inertial Sensor-Based Human Activity Recognition

Dominique Nshimyimana, Vitor Fortes Rey, Sungho Suh, Bo Zhou, Paul Lukowicz

TL;DR

This work tackles the data-label bottleneck in inertial-sensor human activity recognition by introducing Physics-Informed Multi-task Pre-training (PIM), a self-supervised framework that integrates fundamental physical constraints of human motion into pretext tasks. By deriving three physics-based pseudo-labels—Speed of Motion, Angular features, and Symmetry (SAM-tasks)—and training a shared encoder with dedicated heads, PIM learns representations that generalize well with limited labeled data. Comprehensive experiments across four HAR benchmarks show that PIM consistently outperforms state-of-the-art SSL baselines in few-shot settings, with substantial gains in macro-F1 and accuracy when only a few examples per class are available. The method highlights the importance of embedding physical principles into SSL for wearables and points to future work on single-device deployment, cross-dataset transfer, and imbalanced-data handling.

Abstract

Human activity recognition (HAR) with deep learning models relies on large amounts of labeled data, often challenging to obtain due to associated cost, time, and labor. Self-supervised learning (SSL) has emerged as an effective approach to leverage unlabeled data through pretext tasks, such as masked reconstruction and multitask learning with signal processing-based data augmentations, to pre-train encoder models. However, such methods are often derived from computer vision approaches that disregard physical mechanisms and constraints that govern wearable sensor data and the phenomena they reflect. In this paper, we propose a physics-informed multi-task pre-training (PIM) framework for IMU-based HAR. PIM generates pre-text tasks based on the understanding of basic physical aspects of human motion: including movement speed, angles of movement, and symmetry between sensor placements. Given a sensor signal, we calculate corresponding features using physics-based equations and use them as pretext tasks for SSL. This enables the model to capture fundamental physical characteristics of human activities, which is especially relevant for multi-sensor systems. Experimental evaluations on four HAR benchmark datasets demonstrate that the proposed method outperforms existing state-of-the-art methods, including data augmentation and masked reconstruction, in terms of accuracy and F1 score. We have observed gains of almost 10\% in macro f1 score and accuracy with only 2 to 8 labeled examples per class and up to 3% when there is no reduction in the amount of training data.

PIM: Physics-Informed Multi-task Pre-training for Improving Inertial Sensor-Based Human Activity Recognition

TL;DR

Abstract

PIM: Physics-Informed Multi-task Pre-training for Improving Inertial Sensor-Based Human Activity Recognition

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)