Table of Contents
Fetching ...

Emotional Supporters often Use Multiple Strategies in a Single Turn

Xin Bai, Guanyi Chen, Tingting He, Chenlian Zhou, Yu Liu

TL;DR

This work reveals that emotional supporters often deploy multiple strategies within a single turn, a pattern overlooked by prior ESC task formulations. By analyzing the ESConv dataset, the authors redefine the ESC task to generate complete strategy–utterance sequences in a single supporter turn, and preprocess ESConv into variants for training models that explicitly model consecutive strategy usage. They evaluate three supervised approaches (MCC, MTL, CP) and prompt-based LLMs, finding that, under the refined task, LLMs (notably GPT-4o and DeepSeek-R1) outperform both supervised models and human supporters in human evaluations, while CP best captures exact strategy sequences and EMR/LR. These results challenge prior claims about LLM limitations in ESC and demonstrate that task formulation critically shapes model performance, with LLMs showing strong holistic support capabilities, including asking questions and offering concrete suggestions. The work highlights new evaluation metrics (EMR, LR, ALD) tailored to multi-strategy generation and suggests future work in robust, multi-turn, expert-evaluated assessments.

Abstract

Emotional Support Conversations (ESC) are crucial for providing empathy, validation, and actionable guidance to individuals in distress. However, existing definitions of the ESC task oversimplify the structure of supportive responses, typically modelling them as single strategy-utterance pairs. Through a detailed corpus analysis of the ESConv dataset, we identify a common yet previously overlooked phenomenon: emotional supporters often employ multiple strategies consecutively within a single turn. We formally redefine the ESC task to account for this, proposing a revised formulation that requires generating the full sequence of strategy-utterance pairs given a dialogue history. To facilitate this refined task, we introduce several modelling approaches, including supervised deep learning models and large language models. Our experiments show that, under this redefined task, state-of-the-art LLMs outperform both supervised models and human supporters. Notably, contrary to some earlier findings, we observe that LLMs frequently ask questions and provide suggestions, demonstrating more holistic support capabilities.

Emotional Supporters often Use Multiple Strategies in a Single Turn

TL;DR

This work reveals that emotional supporters often deploy multiple strategies within a single turn, a pattern overlooked by prior ESC task formulations. By analyzing the ESConv dataset, the authors redefine the ESC task to generate complete strategy–utterance sequences in a single supporter turn, and preprocess ESConv into variants for training models that explicitly model consecutive strategy usage. They evaluate three supervised approaches (MCC, MTL, CP) and prompt-based LLMs, finding that, under the refined task, LLMs (notably GPT-4o and DeepSeek-R1) outperform both supervised models and human supporters in human evaluations, while CP best captures exact strategy sequences and EMR/LR. These results challenge prior claims about LLM limitations in ESC and demonstrate that task formulation critically shapes model performance, with LLMs showing strong holistic support capabilities, including asking questions and offering concrete suggestions. The work highlights new evaluation metrics (EMR, LR, ALD) tailored to multi-strategy generation and suggests future work in robust, multi-turn, expert-evaluated assessments.

Abstract

Emotional Support Conversations (ESC) are crucial for providing empathy, validation, and actionable guidance to individuals in distress. However, existing definitions of the ESC task oversimplify the structure of supportive responses, typically modelling them as single strategy-utterance pairs. Through a detailed corpus analysis of the ESConv dataset, we identify a common yet previously overlooked phenomenon: emotional supporters often employ multiple strategies consecutively within a single turn. We formally redefine the ESC task to account for this, proposing a revised formulation that requires generating the full sequence of strategy-utterance pairs given a dialogue history. To facilitate this refined task, we introduce several modelling approaches, including supervised deep learning models and large language models. Our experiments show that, under this redefined task, state-of-the-art LLMs outperform both supervised models and human supporters. Notably, contrary to some earlier findings, we observe that LLMs frequently ask questions and provide suggestions, demonstrating more holistic support capabilities.

Paper Structure

This paper contains 34 sections, 6 equations, 5 figures, 6 tables.

Figures (5)

  • Figure 1: Example of an emotional support conversation. The part in the red window is the dialogue history, and the part in the blue window is the target that the ESC task asks an ESDS to produce. This Figure contains three tasks on ESC: (1) emotional support utterance generation; (2) emotional support message continuation; (3) the refined ESC task.
  • Figure 2: The distribution of responses and the average response length based on the number of strategies employed. Note that the average length is 0 for responses with 6 strategies, as no such responses exist in the ESConv dataset.
  • Figure 3: Diagrams of our supervised approaches. These approaches model CUS as (1) multi-class classification, (2) multi-task learning, and (3) content planning.
  • Figure 4: The distribution of responses and the average response length for human, MCC, MTL, CP, GPT-4o, and DeepSeek-R1, based on the number of strategies employed.
  • Figure 5: The frequency of each strategy used by each LLM and Human.