Modelling the Distribution of Human Motion for Sign Language Assessment
Oliver Cory, Ozge Mercanoglu Sincan, Matthew Vowels, Alessia Battisti, Franz Holzknecht, Katja Tissi, Sandra Sidler-Miserez, Tobias Haug, Sarah Ebling, Richard Bowden
TL;DR
This work addresses SLA for continuous Sign Language by modelling the natural distribution of human motion across multiple native signers. It introduces a SkeletonVAE to embed 3D skeletal poses, selects a representative reference per sentence, and builds a Motion Envelope with Gaussian Processes to quantify learner deviations. Across a Sentence Repetition Test dataset, the approach correlates with human ratings and enables spatio-temporal anomaly detection for targeted feedback. The method offers interpretable, probabilistic assessments and lays groundwork for extending to non-manual features in future SLA systems.
Abstract
Sign Language Assessment (SLA) tools are useful to aid in language learning and are underdeveloped. Previous work has focused on isolated signs or comparison against a single reference video to assess Sign Languages (SL). This paper introduces a novel SLA tool designed to evaluate the comprehensibility of SL by modelling the natural distribution of human motion. We train our pipeline on data from native signers and evaluate it using SL learners. We compare our results to ratings from a human raters study and find strong correlation between human ratings and our tool. We visually demonstrate our tools ability to detect anomalous results spatio-temporally, providing actionable feedback to aid in SL learning and assessment.
