Trajectory-guided Motion Perception for Facial Expression Quality Assessment in Neurological Disorders
Shuchao Duan, Amirhossein Dadashzadeh, Alan Whone, Majid Mirmehdi
TL;DR
This paper tackles automated FEQA for neurological disorders by proposing Trajectory-guided Motion Perception Transformer (TraMP-Former), a dual-stream architecture that fuses 2D facial landmark trajectories (processed by SkateFormer) with RGB frame semantics (via Former-DFER). The cross-modal TraMP fusion blocks update only the RGB stream while leveraging trajectory-derived motion as keys/values, enabling fine-grained motion capture. Evaluations on PFED5 and an augmented Toronto NeuroFace dataset show state-of-the-art performance, with average Spearman correlations of $ ho$ = 71.86% and 53.84%, respectively, illustrating the benefit of incorporating landmark trajectories for nuanced expression quality assessment. Ablation studies confirm the importance of trajectory representation, fusion strategy, and temporal-length choices, supporting the method’s robustness and potential clinical impact. The work suggests extending to 3D landmarks to further mitigate head-rotation noise and enhance trajectory-based motion perception in FEQA.
Abstract
Automated facial expression quality assessment (FEQA) in neurological disorders is critical for enhancing diagnostic accuracy and improving patient care, yet effectively capturing the subtle motions and nuances of facial muscle movements remains a challenge. We propose to analyse facial landmark trajectories, a compact yet informative representation, that encodes these subtle motions from a high-level structural perspective. Hence, we introduce Trajectory-guided Motion Perception Transformer (TraMP-Former), a novel FEQA framework that fuses landmark trajectory features for fine-grained motion capture with visual semantic cues from RGB frames, ultimately regressing the combined features into a quality score. Extensive experiments demonstrate that TraMP-Former achieves new state-of-the-art performance on benchmark datasets with neurological disorders, including PFED5 (up by 6.51%) and an augmented Toronto NeuroFace (up by 7.62%). Our ablation studies further validate the efficiency and effectiveness of landmark trajectories in FEQA. Our code is available at https://github.com/shuchaoduan/TraMP-Former.
