UserBoost: Generating User-specific Synthetic Data for Faster Enrolment into Behavioural Biometric Systems
George Webber, Jack Sturgess, Ivan Martinovic
TL;DR
This work tackles the enrollment burden in behavioural biometric smartwatch authentication by generating user-specific synthetic IMU gestures using a regularised autoencoder, enabling effective training of a lightweight classifier. The authors design a VAE-based generative model with explicit latent-space regularisation and a user-clustering objective, and they explore multiple latent-space sampling strategies to produce diverse, high-fidelity synthetic gestures. Evaluation on the WatchAuth dataset shows that synthetic data can substantially reduce the number of real gestures required for enrolment (up to ~40% fewer) while maintaining or improving usability metrics such as FAR@0 and AUROC, though gains vary by user. The approach offers a practical path to faster, privacy-preserving enrolment in resource-constrained wearable authentication and suggests avenues for broader application and integration with other generative methods.
Abstract
Behavioural biometric authentication systems entail an enrolment period that is burdensome for the user. In this work, we explore generating synthetic gestures from a few real user gestures with generative deep learning, with the application of training a simple (i.e. non-deep-learned) authentication model. Specifically, we show that utilising synthetic data alongside real data can reduce the number of real datapoints a user must provide to enrol into a biometric system. To validate our methods, we use the publicly available dataset of WatchAuth, a system proposed in 2022 for authenticating smartwatch payments using the physical gesture of reaching towards a payment terminal. We develop a regularised autoencoder model for generating synthetic user-specific wrist motion data representing these physical gestures, and demonstrate the diversity and fidelity of our synthetic gestures. We show that using synthetic gestures in training can improve classification ability for a real-world system. Through this technique we can reduce the number of gestures required to enrol a user into a WatchAuth-like system by more than 40% without negatively impacting its error rates.
