Adaptive Temporal Motion Guided Graph Convolution Network for Micro-expression Recognition
Fengyuan Zhang, Zhaopei Huang, Xinjie Zhang, Qin Jin
TL;DR
This paper addresses micro-expression recognition by explicitly modeling temporal dependencies across entire clips. It introduces Adaptive Temporal Motion Guided Graph Convolution Network (ATM-GCN), which combines Temporal Motion Pairing & Encoding with an Adaptive Temporal Motion Layer to fuse global and local motion information via a graph containing a Global Motion Node and Local Motion Nodes. The method uses adaptive edge weights, a forgetting-rate based adjacency update, and a Self-Attention classifier, achieving state-of-the-art results on CAS(ME)$^3$ and Composite datasets and competitive performance on SAMM and CASME II. The proposed framework improves MER performance by mitigating temporal redundancy and emphasizing temporally informative motions, with attention-based visualization supporting its ability to focus on critical temporal regions.
Abstract
Micro-expressions serve as essential cues for understanding individuals' genuine emotional states. Recognizing micro-expressions attracts increasing research attention due to its various applications in fields such as business negotiation and psychotherapy. However, the intricate and transient nature of micro-expressions poses a significant challenge to their accurate recognition. Most existing works either neglect temporal dependencies or suffer from redundancy issues in clip-level recognition. In this work, we propose a novel framework for micro-expression recognition, named the Adaptive Temporal Motion Guided Graph Convolution Network (ATM-GCN). Our framework excels at capturing temporal dependencies between frames across the entire clip, thereby enhancing micro-expression recognition at the clip level. Specifically, the integration of Adaptive Temporal Motion layers empowers our method to aggregate global and local motion features inherent in micro-expressions. Experimental results demonstrate that ATM-GCN not only surpasses existing state-of-the-art methods, particularly on the Composite dataset, but also achieves superior performance on the latest micro-expression dataset CAS(ME)$^3$.
