Time Series Similarity Score Functions to Monitor and Interact with the Training and Denoising Process of a Time Series Diffusion Model applied to a Human Activity Recognition Dataset based on IMUs
Heiko Oppel, Andreas Spilz, Michael Munz
TL;DR
The paper addresses the challenge of assessing DDPM-generated time-series data quality, which is not well captured by standard loss functions. It proposes PSD-based similarity metrics and a class-optimized global alignment kernel (C-Opt GAK) that are integrated into both the training and denoising phases of an IMU-based time-series diffusion model (IMUDiffusion) to guide early stopping. The study demonstrates that similarity-guided training reduces training epochs by ~20% and can improve downstream HAR classifier performance, with notable gains for several participants in LOSOCV, while denoising-guided stopping offers additional but variable benefits. These findings suggest a practical pathway to more efficient, data-efficient generative modeling for wearable-sensor time series and have potential applicability to broader diffusion-based sequence generation tasks.
Abstract
Denoising diffusion probabilistic models are able to generate synthetic sensor signals. The training process of such a model is controlled by a loss function which measures the difference between the noise that was added in the forward process and the noise that was predicted by the diffusion model. This enables the generation of realistic data. However, the randomness within the process and the loss function itself makes it difficult to estimate the quality of the data. Therefore, we examine multiple similarity metrics and adapt an existing metric to overcome this issue by monitoring the training and synthetisation process using those metrics. The adapted metric can even be fine-tuned on the input data to comply with the requirements of an underlying classification task. We were able to significantly reduce the amount of training epochs without a performance reduction in the classification task. An optimized training process not only saves resources, but also reduces the time for training generative models.
