KVC-onGoing: Keystroke Verification Challenge
Giuseppe Stragapede, Ruben Vera-Rodriguez, Ruben Tolosana, Aythami Morales, Ivan DeAndres-Tame, Naser Damer, Julian Fierrez, Javier Ortega-Garcia, Alejandro Acien, Nahuel Gonzalez, Andrei Shadrikov, Dmitrii Gordin, Leon Schmitt, Daniel Wimmer, Christoph Großmann, Joerdis Krieger, Florian Heinz, Ron Krestel, Christoffer Mayer, Simon Haberl, Helena Gschrey, Yosuke Yamagishi, Sanjay Saha, Sanka Rasnayaka, Sandareka Wickramanayake, Terence Sim, Weronika Gutfeter, Adam Baran, Mateusz Krzysztoń, Przemysław Jaskóła
TL;DR
KVC-onGoing introduces a large-scale, open benchmark for keystroke dynamics-based biometric verification, unifying data from the Aalto Desktop and Mobile Keystroke Databases under a standardized experimental protocol hosted on CodaLab. The framework supports desktop and mobile tasks with open-set learning, extensive enrolment/verification schedules, and dual evaluation modes to assess global and per-subject performance, paired with a comprehensive fairness analysis. Across diverse architectures (RNNs, CNNs, Transformers) and losses (SetMargin, ArcFace, triplet, contrastive), the study identifies LSIA and VeriKVC as leading solutions, achieving $EER$ as low as $3.33$% (desktop) and $3.61$% (mobile), demonstrating the strong discriminative power of keystroke dynamics. The work also highlights modest demographic biases in KD and emphasizes the need for further investigation into fairness across age and gender, suggesting future directions in architecture optimization and richer demographic analyses to strengthen practical deployment and privacy considerations.
Abstract
This article presents the Keystroke Verification Challenge - onGoing (KVC-onGoing), on which researchers can easily benchmark their systems in a common platform using large-scale public databases, the Aalto University Keystroke databases, and a standard experimental protocol. The keystroke data consist of tweet-long sequences of variable transcript text from over 185,000 subjects, acquired through desktop and mobile keyboards simulating real-life conditions. The results on the evaluation set of KVC-onGoing have proved the high discriminative power of keystroke dynamics, reaching values as low as 3.33% of Equal Error Rate (EER) and 11.96% of False Non-Match Rate (FNMR) @1% False Match Rate (FMR) in the desktop scenario, and 3.61% of EER and 17.44% of FNMR @1% at FMR in the mobile scenario, significantly improving previous state-of-the-art results. Concerning demographic fairness, the analyzed scores reflect the subjects' age and gender to various extents, not negligible in a few cases. The framework runs on CodaLab.
