Boosting Facial Action Unit Detection Through Jointly Learning Facial Landmark Detection and Domain Separation and Reconstruction

Ziqiao Shang; Li Yu

Boosting Facial Action Unit Detection Through Jointly Learning Facial Landmark Detection and Domain Separation and Reconstruction

Ziqiao Shang, Li Yu

TL;DR

A new AU detection framework where multi-task learning is introduced to jointly learn AU domain separation and reconstruction and facial landmark detection by sharing the parameters of homostructural facial extraction modules is proposed.

Abstract

Recently how to introduce large amounts of unlabeled facial images in the wild into supervised Facial Action Unit (AU) detection frameworks has become a challenging problem. In this paper, we propose a new AU detection framework where multi-task learning is introduced to jointly learn AU domain separation and reconstruction and facial landmark detection by sharing the parameters of homostructural facial extraction modules. In addition, we propose a new feature alignment scheme based on contrastive learning by simple projectors and an improved contrastive loss, which adds four additional intermediate supervisors to promote the feature reconstruction process. Experimental results on two benchmarks demonstrate our superiority against the state-of-the-art methods for AU detection in the wild.

Boosting Facial Action Unit Detection Through Jointly Learning Facial Landmark Detection and Domain Separation and Reconstruction

TL;DR

Abstract

Paper Structure (15 sections, 3 equations, 1 figure, 3 tables)

This paper contains 15 sections, 3 equations, 1 figure, 3 tables.

Introduction
Our Proposed Method
Overview
Multi-task learning training strategy
AU Domain Feature Separation and Reconstruction
Feature Alignment scheme and AU detection
Experiments
Experiment Settings
Datasets
Implementation Details
Comparison with State-of-the-Art Methods
AU Detection in Target Domain
AU Detection in Source Domain
Ablation Study
Conclusion

Figures (1)

Figure 1: Overview of our framework, where the same features and modules that have the same structure are marked by the same color. $E_{f}$, $G_{b}$, $E_{l}$, $G_{st}$, $D_{l}$, $E_{au}$ and $P_{ij}$ are shared by source-domain and target-domain input images.

Boosting Facial Action Unit Detection Through Jointly Learning Facial Landmark Detection and Domain Separation and Reconstruction

TL;DR

Abstract

Boosting Facial Action Unit Detection Through Jointly Learning Facial Landmark Detection and Domain Separation and Reconstruction

Authors

TL;DR

Abstract

Table of Contents

Figures (1)