nanoML for Human Activity Recognition
Alan T. L. Bacellar, Mugdha P. Jadhao, Shashank Nag, Priscila M. V. Lima, Felipe M. G. Franca, Lizy K. John
TL;DR
This paper tackles the challenge of energy-efficient Human Activity Recognition (HAR) on resource-constrained wearables. It introduces Differentiable Weightless Neural Networks (DWNs), a LUT-based, hardware-friendly nanoML approach trained with Extended Finite Difference and Learnable Mapping, capable of end-to-end optimization. On the UCI-HAR dataset, DWNs achieve 96.34%–96.67% accuracy with per-sample energy as low as 56–104 nJ and end-to-end latency of 5 ns when deployed on an FPGA, while preserving a tiny model footprint. The results demonstrate ultra-low-power edge HAR with substantial energy and memory savings (up to 926,000× energy and 260× memory) compared to CNNs/Transformers, highlighting strong potential for ASIC-enabled wearables and practical edge AI deployment.
Abstract
Human Activity Recognition (HAR) is critical for applications in healthcare, fitness, and IoT, but deploying accurate models on resource-constrained devices remains challenging due to high energy and memory demands. This paper demonstrates the application of Differentiable Weightless Neural Networks (DWNs) to HAR, achieving competitive accuracies of 96.34% and 96.67% while consuming only 56nJ and 104nJ per sample, with an inference time of just 5ns per sample. The DWNs were implemented and evaluated on an FPGA, showcasing their practical feasibility for energy-efficient hardware deployment. DWNs achieve up to 926,000x energy savings and 260x memory reduction compared to state-of-the-art deep learning methods. These results position DWNs as a nano-machine learning nanoML model for HAR, setting a new benchmark in energy efficiency and compactness for edge and wearable devices, paving the way for ultra-efficient edge AI.
