Meta-Auxiliary Learning for Micro-Expression Recognition
Jingyao Wang, Yunhan Tian, Yuxuan Yang, Xiaoxin Chen, Changwen Zheng, Wenwen Qiang
TL;DR
LightmanNet tackles micro-expression recognition under real-world constraints by deploying a dual-branch meta-auxiliary learning framework. It combines a primary MER branch with an auxiliary image-alignment branch, trained via bi-level optimization to first learn task-specific knowledge and then distill general MER knowledge across tasks. The auxiliary branch leverages macro-expression similarities to guide discriminative feature learning, reducing reliance on large labeled ME datasets. Across five benchmark datasets and robustness tests (including few-shot and noisy data), LightmanNet achieves state-of-the-art performance with favorable efficiency, indicating strong practical potential for real-world MER applications.
Abstract
Micro-expressions (MEs) are involuntary movements revealing people's hidden feelings, which has attracted numerous interests for its objectivity in emotion detection. However, despite its wide applications in various scenarios, micro-expression recognition (MER) remains a challenging problem in real life due to three reasons, including (i) data-level: lack of data and imbalanced classes, (ii) feature-level: subtle, rapid changing, and complex features of MEs, and (iii) decision-making-level: impact of individual differences. To address these issues, we propose a dual-branch meta-auxiliary learning method, called LightmanNet, for fast and robust micro-expression recognition. Specifically, LightmanNet learns general MER knowledge from limited data through a dual-branch bi-level optimization process: (i) In the first level, it obtains task-specific MER knowledge by learning in two branches, where the first branch is for learning MER features via primary MER tasks, while the other branch is for guiding the model obtain discriminative features via auxiliary tasks, i.e., image alignment between micro-expressions and macro-expressions since their resemblance in both spatial and temporal behavioral patterns. The two branches of learning jointly constrain the model of learning meaningful task-specific MER knowledge while avoiding learning noise or superficial connections between MEs and emotions that may damage its generalization ability. (ii) In the second level, LightmanNet further refines the learned task-specific knowledge, improving model generalization and efficiency. Extensive experiments on various benchmark datasets demonstrate the superior robustness and efficiency of LightmanNet.
