Face-GPS: A Comprehensive Technique for Quantifying Facial Muscle Dynamics in Videos

Juni Kim; Zhikang Dong; Pawel Polak

Face-GPS: A Comprehensive Technique for Quantifying Facial Muscle Dynamics in Videos

Juni Kim, Zhikang Dong, Pawel Polak

TL;DR

Face-GPS targets the challenge of quantifying facial muscle dynamics from widely available video by creating canonical face representations that are robust to head motion, then measuring muscle activity with DISC-inspired optical flow. The approach couples spectral and wavelet smoothing with a novel Multiple Kernel Smoothing (MKS) framework that fuses multiple facial descriptors via Gaussian radial basis kernels, weighted by a FAN based expression model, yielding smoothed displacement signals $r''_j$ for each facial region. Empirically, the method achieves strong emotion-related muscle movement identification on CK+ and delivers competitive classification performance (85% average accuracy, 86.1% with FAN) using only muscle displacements, underscoring interpretability and potential clinical utility. The work offers a non-invasive, explainable alternative to deep models with applications in remote diagnosis, plastic surgery planning, and security, while acknowledging challenges in accurately modeling boundary regions of the facial manifold.

Abstract

We introduce a novel method that combines differential geometry, kernels smoothing, and spectral analysis to quantify facial muscle activity from widely accessible video recordings, such as those captured on personal smartphones. Our approach emphasizes practicality and accessibility. It has significant potential for applications in national security and plastic surgery. Additionally, it offers remote diagnosis and monitoring for medical conditions such as stroke, Bell's palsy, and acoustic neuroma. Moreover, it is adept at detecting and classifying emotions, from the overt to the subtle. The proposed face muscle analysis technique is an explainable alternative to deep learning methods and a non-invasive substitute to facial electromyography (fEMG).

Face-GPS: A Comprehensive Technique for Quantifying Facial Muscle Dynamics in Videos

TL;DR

for each facial region. Empirically, the method achieves strong emotion-related muscle movement identification on CK+ and delivers competitive classification performance (85% average accuracy, 86.1% with FAN) using only muscle displacements, underscoring interpretability and potential clinical utility. The work offers a non-invasive, explainable alternative to deep models with applications in remote diagnosis, plastic surgery planning, and security, while acknowledging challenges in accurately modeling boundary regions of the facial manifold.

Abstract

Paper Structure (5 sections, 1 equation, 1 figure, 1 table)

This paper contains 5 sections, 1 equation, 1 figure, 1 table.

Introduction
Methodology
Empirical Results
Conclusions
Potential negative societal impact

Figures (1)

Figure 1: Our method applied on videos exhibiting Happy, Fear, Disgust, Surprise, Sad, Anger, and Contempt from left to right.

Face-GPS: A Comprehensive Technique for Quantifying Facial Muscle Dynamics in Videos

TL;DR

Abstract

Face-GPS: A Comprehensive Technique for Quantifying Facial Muscle Dynamics in Videos

Authors

TL;DR

Abstract

Table of Contents

Figures (1)