Past, Present, and Future of Sensor-Based Human Activity Recognition Using Wearables: A Surveying Tutorial on a Still Challenging Task
Harish Haresamudram, Chi Ian Tang, Sungho Suh, Paul Lukowicz, Thomas Ploetz
TL;DR
This survey traces the evolution of sensor-based HAR from hand-crafted features and the Activity Recognition Chain to end-to-end deep learning, identifying core limitations such as data scarcity, information coding, and signal ambivalence. It highlights a current shift toward learning rich representations through self-supervised and contrastive learning, including multi-device and multi-modal approaches, and reviews data generation/augmentation strategies (video-to-IMU, GANs, diffusion, simulations). The paper then argues for a third paradigm: leveraging foundational models (LLMs and TS foundation models) to inject world knowledge, perform cross-modal alignment, and generate diverse synthetic data to improve generalization in real-world HAR. It provides a hands-on tutorial and accompanying code to guide practitioners in building practical HAR systems that scale beyond traditional benchmarks. Significantly, the work outlines actionable directions for integrating foundational models to address long-standing HAR challenges and broaden the applicability of wearables in health, sport, and industrial settings.
Abstract
In the many years since the inception of wearable sensor-based Human Activity Recognition (HAR), a wide variety of methods have been introduced and evaluated for their ability to recognize activities. Substantial gains have been made since the days of hand-crafting heuristics as features, yet, progress has seemingly stalled on many popular benchmarks, with performance falling short of what may be considered 'sufficient'-- despite the increase in computational power and scale of sensor data, as well as rising complexity in techniques being employed. The HAR community approaches a new paradigm shift, this time incorporating world knowledge from foundational models. In this paper, we take stock of sensor-based HAR -- surveying it from its beginnings to the current state of the field, and charting its future. This is accompanied by a hands-on tutorial, through which we guide practitioners in developing HAR systems for real-world application scenarios. We provide a compendium for novices and experts alike, of methods that aim at finally solving the activity recognition problem.
