CMED: A Child Micro-Expression Dataset
Nikin~Matharaarachchi, Muhammad~Fermi Pasha, Sonya~Coleman, Kah PengWong
TL;DR
This work addresses the absence of child micro-expression data by introducing CMED, the first in-the-wild video dataset of spontaneous child micro-expressions collected during teletherapy-like Zoom sessions from 74 subjects. It details a full dataset creation pipeline—data collection, automated preprocessing, careful labelling with METT-based training, and a three-stage verification process—plus a 6-class and a 3-class variant (including No-ME). Baseline evaluations using LBP-TOP, VGG-16, and DGCNN show deep learning methods outperform handcrafted features, with DGCNN benefiting from body-landmark graphs and VGG-16 offering strong apex-frame-based performance for the 3-class variant. CMED’s real-world collection setting and comprehensive labeling enable robust analysis of child ME differences vs. adults and hold promise for improving computer-assisted therapy, especially in remote or rural contexts; access to the dataset is available upon request under a Data Access Agreement.
Abstract
Micro-expressions are short bursts of emotion that are difficult to hide. Their detection in children is an important cue to assist psychotherapists in conducting better therapy. However, existing research on the detection of micro-expressions has focused on adults, whose expressions differ in their characteristics from those of children. The lack of research is a direct consequence of the lack of a child-based micro-expressions dataset as it is much more challenging to capture children's facial expressions due to the lack of predictability and controllability. This study compiles a dataset of spontaneous child micro-expression videos, the first of its kind, to the best of the authors knowledge. The dataset is captured in the wild using video conferencing software. This dataset enables us to then explore key features and differences between adult and child micro-expressions. This study also establishes a baseline for the automated spotting and recognition of micro-expressions in children using three approaches comprising of hand-created and learning-based approaches.
