Alleviating Catastrophic Forgetting in Facial Expression Recognition with Emotion-Centered Models
Israel A. Laurensi, Alceu de Souza Britto, Jean Paul Barddal, Alessandro Lameiras Koerich
TL;DR
The paper tackles catastrophic forgetting in facial expression recognition by introducing emotion-centered generative replay (ECgr) that uses class-specific WGAN-GP generated images alongside a QA filter to retain past knowledge while learning new emotions. It couples ECgr with a weighted loss to account for the confidence of the CNN and demonstrates the approach across FER datasets, showing ECgr—especially when paired with QA—approaches joint training performance without accessing all past data. The contributions include a novel pseudo-rehearsal framework for FER, a QA-based filtering mechanism, and a structured offline/continual-learning pipeline with a loss formulation that incorporates image quality. The findings suggest this strategy effectively mitigates forgetting, offering a practical path for continual FER in dynamic settings, albeit with higher computational cost and sensitivity to weighting choices.
Abstract
Facial expression recognition is a pivotal component in machine learning, facilitating various applications. However, convolutional neural networks (CNNs) are often plagued by catastrophic forgetting, impeding their adaptability. The proposed method, emotion-centered generative replay (ECgr), tackles this challenge by integrating synthetic images from generative adversarial networks. Moreover, ECgr incorporates a quality assurance algorithm to ensure the fidelity of generated images. This dual approach enables CNNs to retain past knowledge while learning new tasks, enhancing their performance in emotion recognition. The experimental results on four diverse facial expression datasets demonstrate that incorporating images generated by our pseudo-rehearsal method enhances training on the targeted dataset and the source dataset while making the CNN retain previously learned knowledge.
