Table of Contents
Fetching ...

AnxietyFaceTrack: A Smartphone-Based Non-Intrusive Approach for Detecting Social Anxiety Using Facial Features

Nilesh Kumar Sahu, Snehil Gupta, Haroon R Lone

TL;DR

Social Anxiety Disorder lacks objective biomarkers, hindering early detection. AnxietyFaceTrack presents a smartphone-based, non-intrusive approach that analyzes facial cues during naturalistic social interactions to classify anxiety states. Using 669 OpenFace facial features from 2-minute interaction videos of 85–91 participants and a Random Forest classifier, the study reports multiclass accuracy around 91% and binary accuracies around 92–93%, with 3D landmarks and head pose driving performance. SHAP analysis provides interpretability, revealing informative facial cues, while bias analyses indicate gender- and education-related differences that warrant fairness considerations. Overall, the work demonstrates the feasibility of continuous, low-cost anxiety monitoring with potential real-world impact, subject to validation in uncontrolled settings and further bias mitigation.

Abstract

Social Anxiety Disorder (SAD) is a widespread mental health condition, yet its lack of objective markers hinders timely detection and intervention. While previous research has focused on behavioral and non-verbal markers of SAD in structured activities (e.g., speeches or interviews), these settings fail to replicate real-world, unstructured social interactions fully. Identifying non-verbal markers in naturalistic, unstaged environments is essential for developing ubiquitous and non-intrusive monitoring solutions. To address this gap, we present AnxietyFaceTrack, a study leveraging facial video analysis to detect anxiety in unstaged social settings. A cohort of 91 participants engaged in a social setting with unfamiliar individuals and their facial videos were recorded using a low-cost smartphone camera. We examined facial features, including eye movements, head position, facial landmarks, and facial action units, and used self-reported survey data to establish ground truth for multiclass (anxious, neutral, non-anxious) and binary (e.g., anxious vs. neutral) classifications. Our results demonstrate that a Random Forest classifier trained on the top 20% of features achieved the highest accuracy of 91.0% for multiclass classification and an average accuracy of 92.33% across binary classifications. Notably, head position and facial landmarks yielded the best performance for individual facial regions, achieving 85.0% and 88.0% accuracy, respectively, in multiclass classification, and 89.66% and 91.0% accuracy, respectively, across binary classifications. This study introduces a non-intrusive, cost-effective solution that can be seamlessly integrated into everyday smartphones for continuous anxiety monitoring, offering a promising pathway for early detection and intervention.

AnxietyFaceTrack: A Smartphone-Based Non-Intrusive Approach for Detecting Social Anxiety Using Facial Features

TL;DR

Social Anxiety Disorder lacks objective biomarkers, hindering early detection. AnxietyFaceTrack presents a smartphone-based, non-intrusive approach that analyzes facial cues during naturalistic social interactions to classify anxiety states. Using 669 OpenFace facial features from 2-minute interaction videos of 85–91 participants and a Random Forest classifier, the study reports multiclass accuracy around 91% and binary accuracies around 92–93%, with 3D landmarks and head pose driving performance. SHAP analysis provides interpretability, revealing informative facial cues, while bias analyses indicate gender- and education-related differences that warrant fairness considerations. Overall, the work demonstrates the feasibility of continuous, low-cost anxiety monitoring with potential real-world impact, subject to validation in uncontrolled settings and further bias mitigation.

Abstract

Social Anxiety Disorder (SAD) is a widespread mental health condition, yet its lack of objective markers hinders timely detection and intervention. While previous research has focused on behavioral and non-verbal markers of SAD in structured activities (e.g., speeches or interviews), these settings fail to replicate real-world, unstructured social interactions fully. Identifying non-verbal markers in naturalistic, unstaged environments is essential for developing ubiquitous and non-intrusive monitoring solutions. To address this gap, we present AnxietyFaceTrack, a study leveraging facial video analysis to detect anxiety in unstaged social settings. A cohort of 91 participants engaged in a social setting with unfamiliar individuals and their facial videos were recorded using a low-cost smartphone camera. We examined facial features, including eye movements, head position, facial landmarks, and facial action units, and used self-reported survey data to establish ground truth for multiclass (anxious, neutral, non-anxious) and binary (e.g., anxious vs. neutral) classifications. Our results demonstrate that a Random Forest classifier trained on the top 20% of features achieved the highest accuracy of 91.0% for multiclass classification and an average accuracy of 92.33% across binary classifications. Notably, head position and facial landmarks yielded the best performance for individual facial regions, achieving 85.0% and 88.0% accuracy, respectively, in multiclass classification, and 89.66% and 91.0% accuracy, respectively, across binary classifications. This study introduces a non-intrusive, cost-effective solution that can be seamlessly integrated into everyday smartphones for continuous anxiety monitoring, offering a promising pathway for early detection and intervention.

Paper Structure

This paper contains 28 sections, 7 figures, 6 tables.

Figures (7)

  • Figure 1: The study setup shows participants' sitting positions and camera positions. RA refers to Research Assistant, and labels P1, P2, and P3 refer to participants 1, 2, and 3, respectively. Labels C1, C2, and C3 refer to smartphone cameras 1, 2, and 3, respectively.
  • Figure 2: Distribution of participants' self-reported anxiety
  • Figure 3: Top ten important features for multiclass classification. Face edge (Y_1, Y_4, Y_11, Y_15, Y_16, Z_39), right eye (eye_lmk_Y_43, gaze_1_y, gaze_1_z), and head position angle (pose_Rx). Best viewed in color.
  • Figure 4: Influence of top ten features from Figure \ref{['fig:3_class_shape_summary']} on each class in Multiclass classification.
  • Figure 5: Influence of top ten features on individual classes in Multiclass classification.
  • ...and 2 more figures