Learning with Category-Equivariant Representations for Human Activity Recognition

Yoshihiro Maruyama

Learning with Category-Equivariant Representations for Human Activity Recognition

Yoshihiro Maruyama

TL;DR

This work addresses robustness in human activity recognition by modeling sensor data variability with a Category-Equivariant framework that combines a time-gain group $M=C_T\times\Lambda$ and a sensor-hierarchy poset $P$ into a product category. Data and model are formalized as functors, and category-equivariant representations are natural transformations that commute with both group actions and hierarchical morphisms. A concrete HAR realization uses RMS gain normalization, axis-to-magnitude pooling, and low-frequency real FFT magnitudes, yielding a compact, interpretable feature map that is invariant to time shifts and gain while respecting sensor hierarchies. Experiments on UCI HAR show substantial out-of-distribution robustness (≈3.6× baseline) under time/gain/pose perturbations, with theoretical guarantees: naturality on generators suffices for full equivariance and invariant blocks provide exact robustness under group perturbations. The approach offers data efficiency, robustness, and interpretability, and suggests broader applicability to multimodal sensing and hierarchical, non-invertible symmetry settings.

Abstract

Human activity recognition is challenging because sensor signals shift with context, motion, and environment; effective models must therefore remain stable as the world around them changes. We introduce a categorical symmetry-aware learning framework that captures how signals vary over time, scale, and sensor hierarchy. We build these factors into the structure of feature representations, yielding models that automatically preserve the relationships between sensors and remain stable under realistic distortions such as time shifts, amplitude drift, and device orientation changes. On the UCI Human Activity Recognition benchmark, this categorical symmetry-driven design improves out-of-distribution accuracy by approx. 46 percentage points (approx. 3.6x over the baseline), demonstrating that abstract symmetry principles can translate into concrete performance gains in everyday sensing tasks via category-equivariant representation theory.

Learning with Category-Equivariant Representations for Human Activity Recognition

TL;DR

This work addresses robustness in human activity recognition by modeling sensor data variability with a Category-Equivariant framework that combines a time-gain group

and a sensor-hierarchy poset

into a product category. Data and model are formalized as functors, and category-equivariant representations are natural transformations that commute with both group actions and hierarchical morphisms. A concrete HAR realization uses RMS gain normalization, axis-to-magnitude pooling, and low-frequency real FFT magnitudes, yielding a compact, interpretable feature map that is invariant to time shifts and gain while respecting sensor hierarchies. Experiments on UCI HAR show substantial out-of-distribution robustness (≈3.6× baseline) under time/gain/pose perturbations, with theoretical guarantees: naturality on generators suffices for full equivariance and invariant blocks provide exact robustness under group perturbations. The approach offers data efficiency, robustness, and interpretability, and suggests broader applicability to multimodal sensing and hierarchical, non-invertible symmetry settings.

Learning with Category-Equivariant Representations for Human Activity Recognition

TL;DR

Abstract

Learning with Category-Equivariant Representations for Human Activity Recognition

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Theorems & Definitions (20)