Design Space Exploration on Efficient and Accurate Human Pose Estimation from Sparse IMU-Sensing
Iris Fürst-Walter, Antonio Nappi, Tanja Harbaum, Jürgen Becker
TL;DR
This work addresses the problem of privacy-preserving, energy-efficient human pose estimation by exploring the design space of sparse IMU sensing. It introduces a simulative Design Space Exploration (DSE) that synthesizes IMU data from a body-model dataset, trains a deep learning estimator for thousands of sensor configurations, and evaluates them with a unified accuracy-resource metric. A key contribution is the combined metric $M_i(\lambda) = e_i(1-\lambda) + \lambda i$, which balances pose accuracy (e.g., mesh error) against hardware costs (sensor count). The results identify a four-sensor configuration (pelvis, sternum, and elbows) that achieves a mesh error of 6.03 cm and reduces sensor count by two compared to the state of the art, demonstrating a strong accuracy-resource trade-off for practical health applications. The method lays groundwork for privacy-aware, resource-conscious design of fabric-integrated HPE systems and can guide deployment across rehabilitation and sports domains.
Abstract
Human Pose Estimation (HPE) to assess human motion in sports, rehabilitation or work safety requires accurate sensing without compromising the sensitive underlying personal data. Therefore, local processing is necessary and the limited energy budget in such systems can be addressed by Inertial Measurement Units (IMU) instead of common camera sensing. The central trade-off between accuracy and efficient use of hardware resources is rarely discussed in research. We address this trade-off by a simulative Design Space Exploration (DSE) of a varying quantity and positioning of IMU-sensors. First, we generate IMU-data from a publicly available body model dataset for different sensor configurations and train a deep learning model with this data. Additionally, we propose a combined metric to assess the accuracy-resource trade-off. We used the DSE as a tool to evaluate sensor configurations and identify beneficial ones for a specific use case. Exemplary, for a system with equal importance of accuracy and resources, we identify an optimal sensor configuration of 4 sensors with a mesh error of 6.03 cm, increasing the accuracy by 32.7% and reducing the hardware effort by two sensors compared to state of the art. Our work can be used to design health applications with well-suited sensor positioning and attention to data privacy and resource-awareness.
