Knowledge Distillation for Reservoir-based Classifier: Human Activity Recognition
Masaharu Kagiyama, Tsuyoshi Okita
TL;DR
This work tackles the energy cost of HAR on edge devices by introducing PatchEchoClassifier, a reservoir-based time-series classifier that uses a patch tokenizer and an ESN reservoir. It trains the lightweight ESN student via knowledge distillation from a high-capacity 1DMLP-Mixer teacher through a Mixer-Echo State Signal Distillation framework with class and distillation tokens. The authors demonstrate that PatchEchoClassifier achieves above 80% accuracy while substantially reducing FLOPS, memory footprint, and energy metrics compared with CNN and transformer-based baselines, highlighting its suitability for real-time edge deployment. The study also discusses limitations, such as large Python library footprints, and outlines future directions including reservoir enhancements and quantization to further improve energy efficiency.
Abstract
This paper aims to develop an energy-efficient classifier for time-series data by introducing PatchEchoClassifier, a novel model that leverages a reservoir-based mechanism known as the Echo State Network (ESN). The model is designed for human activity recognition (HAR) using one-dimensional sensor signals and incorporates a tokenizer to extract patch-level representations. To train the model efficiently, we propose a knowledge distillation framework that transfers knowledge from a high-capacity MLP-Mixer teacher to the lightweight reservoir-based student model. Experimental evaluations on multiple HAR datasets demonstrate that our model achieves over 80 percent accuracy while significantly reducing computational cost. Notably, PatchEchoClassifier requires only about one-sixth of the floating point operations (FLOPS) compared to DeepConvLSTM, a widely used convolutional baseline. These results suggest that PatchEchoClassifier is a promising solution for real-time and energy-efficient human activity recognition in edge computing environments.
