Diffusion-Driven Inertial Generated Data for Smartphone Location Classification
Noa Cohen, Rotem Dror, Itzik Klein
TL;DR
The paper tackles the data scarcity challenge in inertial sensing for smartphone location recognition by introducing a diffusion-based data generation framework. It transforms time-series accelerometer data into images via delay embedding (AccSignal2Image), then trains a conditional Elucidated Diffusion Model (EDM) with SongUNet to generate class-conditioned embedded images, which are inverted back to time-domain signals. The method is evaluated on the RIDI dataset using both image- and signal-domain classifiers, with strong fidelity evidenced by a low Fréchet Inception Distance (1.22) and minimal classifier performance gaps between real and synthetic data. The findings suggest that high-quality synthetic inertial data can substantially reduce data collection requirements for robust SLR models and potentially extend to related domains in robotics and healthcare.
Abstract
Despite the crucial role of inertial measurements in motion tracking and navigation systems, the time-consuming and resource-intensive nature of collecting extensive inertial data has hindered the development of robust machine learning models in this field. In recent years, diffusion models have emerged as a revolutionary class of generative models, reshaping the landscape of artificial data generation. These models surpass generative adversarial networks and other state-of-the-art approaches to complex tasks. In this work, we propose diffusion-driven specific force-generated data for smartphone location recognition. We provide a comprehensive evaluation methodology by comparing synthetic and real recorded specific force data across multiple metrics. Our results demonstrate that our diffusion-based generative model successfully captures the distinctive characteristics of specific force signals across different smartphone placement conditions. Thus, by creating diverse, realistic synthetic data, we can reduce the burden of extensive data collection while providing high-quality training data for machine learning models.
