Hi-EF: Benchmarking Emotion Forecasting in Human-interaction
Haoran Wang, Xinji Mai, Zeng Tao, Junxiong Lin, Xuan Tong, Ivy Pan, Shaoqi Yan, Yan Wang, Shuyong Gao
TL;DR
This work reframes Affective Forecasting as Emotion Forecasting (EF) in two-party interactions and introduces the Hi-EF dataset with Multilayered-Contextual Interaction Samples (MCIS) to predict a partner's future emotion from short-term context and current states. A three-function EF paradigm—context fusion, current emotion recognition, and future emotion forecasting—underpins the dataset design, labeling, and baseline modeling, which together demonstrate the task's feasibility. Through comprehensive experiments across multimodal encoders and fusion strategies, the study shows the value of combining contextual information with intra- and inter-video fusion, highlighting practical implications for emotion-aware systems and interactive agents. The Hi-EF resource and EF framework lay groundwork for further research in affective computing beyond current emotion recognition toward predictive, interaction-aware emotion modeling.
Abstract
Affective Forecasting is an psychology task that involves predicting an individual's future emotional responses, often hampered by reliance on external factors leading to inaccuracies, and typically remains at a qualitative analysis stage. To address these challenges, we narrows the scope of Affective Forecasting by introducing the concept of Human-interaction-based Emotion Forecasting (EF). This task is set within the context of a two-party interaction, positing that an individual's emotions are significantly influenced by their interaction partner's emotional expressions and informational cues. This dynamic provides a structured perspective for exploring the patterns of emotional change, thereby enhancing the feasibility of emotion forecasting.
