Recent Trends of Multimodal Affective Computing: A Survey from NLP Perspective

Guimin Hu; Yi Xin; Weimin Lyu; Haojian Huang; Chang Sun; Zhihong Zhu; Lin Gui; Ruichu Cai; Erik Cambria; Hasti Seifi

Recent Trends of Multimodal Affective Computing: A Survey from NLP Perspective

Guimin Hu, Yi Xin, Weimin Lyu, Haojian Huang, Chang Sun, Zhihong Zhu, Lin Gui, Ruichu Cai, Erik Cambria, Hasti Seifi

TL;DR

This survey presents the recent trends of multimodal affective computing from NLP perspective through four hot tasks: multimodal sentiment analysis, multimodal emotion recognition in conversation, multimodal aspect-based sentiment analysis and multimodal multi-label emotion recognition.

Abstract

Multimodal affective computing (MAC) has garnered increasing attention due to its broad applications in analyzing human behaviors and intentions, especially in text-dominated multimodal affective computing field. This survey presents the recent trends of multimodal affective computing from NLP perspective through four hot tasks: multimodal sentiment analysis, multimodal emotion recognition in conversation, multimodal aspect-based sentiment analysis and multimodal multi-label emotion recognition. The goal of this survey is to explore the current landscape of multimodal affective research, identify development trends, and highlight the similarities and differences across various tasks, offering a comprehensive report on the recent progress in multimodal affective computing from an NLP perspective. This survey covers the formalization of tasks, provides an overview of relevant works, describes benchmark datasets, and details the evaluation metrics for each task. Additionally, it briefly discusses research in multimodal affective computing involving facial expressions, acoustic signals, physiological signals, and emotion causes. Additionally, we discuss the technical approaches, challenges, and future directions in multimodal affective computing. To support further research, we released a repository that compiles related works in multimodal affective computing, providing detailed resources and references for the community.

Recent Trends of Multimodal Affective Computing: A Survey from NLP Perspective

TL;DR

Abstract

Paper Structure (77 sections, 1 equation, 8 figures, 3 tables)

This paper contains 77 sections, 1 equation, 8 figures, 3 tables.

Introduction
Organization of this Survey
Multimodal Affective Computing Tasks
Multimodal Sentiment Analysis
Task Formalization
Application Scenarios
Multimodal Emotion Recognition in Conversation
Task Formalization
Application Scenarios
Multimodal Aspect-based Sentiment Analysis
Task Formalization
Application Scenarios
Multimodal Multi-label Emotion Recognition
Task Formalization
Application Scenarios
...and 62 more sections

Figures (8)

Figure 1: Taxonomy of multimodal affective computing from multimodal fusion and multimodal alignment.
Figure 2: Illustration of multimodal fusion from following aspects: 1) cross-modality modal fusion, 2) modal fusion based on modal consistency and difference and 3) multi-stage modal fusion.
Figure 3: Illustration multimodal alignment:(a) semantic alignment and (b) alignment with missing modal fragments.
Figure 4: Taxonomy of multimodal affective computing works from aspects multitask learning, pre-trained model, enhanced knowledge and contextual information.
Figure 5: Illustration of multitask learning in multimodal affective computing tasks.
...and 3 more figures

Recent Trends of Multimodal Affective Computing: A Survey from NLP Perspective

TL;DR

Abstract

Recent Trends of Multimodal Affective Computing: A Survey from NLP Perspective

Authors

TL;DR

Abstract

Table of Contents

Figures (8)