Subject-Based Domain Adaptation for Facial Expression Recognition
Muhammad Osama Zeeshan, Muhammad Haseeb Aslam, Soufiane Belharbi, Alessandro Lameiras Koerich, Marco Pedersoli, Simon Bacon, Eric Granger
TL;DR
The paper tackles FER personalization by treating each subject as a distinct domain and applying multi-source domain adaptation to transfer learning from multiple labeled sources to an unlabeled target. It introduces a two-stage framework: (1) align source subjects using a between-subject discrepancy loss based on maximum mean discrepancy (MMD) and supervised loss; (2) generate reliable target pseudo-labels via Augmented Confident Pseudo Labels (ACPL) using augmentation-thresholding, followed by joint source-target adaptation with L_ce and L_mmd terms. Empirical results on BioVid and UNBC-McMaster show that subject-based MSDA with ACPL outperforms standard UDA and existing MSDA baselines, especially when selecting the most relevant source subjects, and scales well to a large number of sources. These findings demonstrate a practical path toward scalable, subject-aware FER systems with strong domain generalization and personalization capabilities.
Abstract
Adapting a deep learning model to a specific target individual is a challenging facial expression recognition (FER) task that may be achieved using unsupervised domain adaptation (UDA) methods. Although several UDA methods have been proposed to adapt deep FER models across source and target data sets, multiple subject-specific source domains are needed to accurately represent the intra- and inter-person variability in subject-based adaption. This paper considers the setting where domains correspond to individuals, not entire datasets. Unlike UDA, multi-source domain adaptation (MSDA) methods can leverage multiple source datasets to improve the accuracy and robustness of the target model. However, previous methods for MSDA adapt image classification models across datasets and do not scale well to a more significant number of source domains. This paper introduces a new MSDA method for subject-based domain adaptation in FER. It efficiently leverages information from multiple source subjects (labeled source domain data) to adapt a deep FER model to a single target individual (unlabeled target domain data). During adaptation, our subject-based MSDA first computes a between-source discrepancy loss to mitigate the domain shift among data from several source subjects. Then, a new strategy is employed to generate augmented confident pseudo-labels for the target subject, allowing a reduction in the domain shift between source and target subjects. Experiments performed on the challenging BioVid heat and pain dataset with 87 subjects and the UNBC-McMaster shoulder pain dataset with 25 subjects show that our subject-based MSDA can outperform state-of-the-art methods yet scale well to multiple subject-based source domains.
