DeepConvContext: A Multi-Scale Approach to Timeseries Classification in Human Activity Recognition

Marius Bock; Michael Moeller; Kristof Van Laerhoven

DeepConvContext: A Multi-Scale Approach to Timeseries Classification in Human Activity Recognition

Marius Bock, Michael Moeller, Kristof Van Laerhoven

TL;DR

HAR learning traditionally relies on sliding windows, which constrain temporal context. DeepConvContext introduces a multi-scale framework that combines intra-window CNN+LSTM feature extraction with an inter-window LSTM to capture both local and long-range temporal dependencies, with bidirectional inter-patch modeling further boosting performance. Across six HAR benchmarks, it achieves about a 10% average F1-score gain over DeepConvLSTM (up to 21%), and ablations show LSTMs consistently outperform attention- and Transformer-based alternatives for inertial data. This work advances practical HAR by enabling more coherent inter-window reasoning while maintaining competitive computational profiles, and code is publicly available for reproducibility.

Abstract

Despite recognized limitations in modeling long-range temporal dependencies, Human Activity Recognition (HAR) has traditionally relied on a sliding window approach to segment labeled datasets. Deep learning models like the DeepConvLSTM typically classify each window independently, thereby restricting learnable temporal context to within-window information. To address this constraint, we propose DeepConvContext, a multi-scale time series classification framework for HAR. Drawing inspiration from the vision-based Temporal Action Localization community, DeepConvContext models both intra- and inter-window temporal patterns by processing sequences of time-ordered windows. Unlike recent HAR models that incorporate attention mechanisms, DeepConvContext relies solely on LSTMs -- with ablation studies demonstrating the superior performance of LSTMs over attention-based variants for modeling inertial sensor data. Across six widely-used HAR benchmarks, DeepConvContext achieves an average 10% improvement in F1-score over the classic DeepConvLSTM, with gains of up to 21%. Code to reproduce our experiments is publicly available via github.com/mariusbock/context_har.

DeepConvContext: A Multi-Scale Approach to Timeseries Classification in Human Activity Recognition

TL;DR

Abstract

DeepConvContext: A Multi-Scale Approach to Timeseries Classification in Human Activity Recognition

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (3)