Harnessing the Power of Large Language Models for Empathetic Response Generation: Empirical Investigations and Improvements

Yushan Qian; Wei-Nan Zhang; Ting Liu

Harnessing the Power of Large Language Models for Empathetic Response Generation: Empirical Investigations and Improvements

Yushan Qian, Wei-Nan Zhang, Ting Liu

TL;DR

The paper demonstrates that large language models can significantly improve empathetic response generation in dialogues, surpassing state-of-the-art baselines. It introduces three targeted improvements—semantically similar in-context learning, two-stage interactive generation, and knowledge-base augmentation using a commonsense graph via COMET—and validates them with extensive automatic and human evaluations. Additionally, the study explores GPT-4 as a surrogate evaluator, finding meaningful correlations with human judgments. The work advances practical empathetic dialogue systems and provides insights into efficient evaluation and knowledge integration for LLM-based generation.

Abstract

Empathetic dialogue is an indispensable part of building harmonious social relationships and contributes to the development of a helpful AI. Previous approaches are mainly based on fine small-scale language models. With the advent of ChatGPT, the application effect of large language models (LLMs) in this field has attracted great attention. This work empirically investigates the performance of LLMs in generating empathetic responses and proposes three improvement methods of semantically similar in-context learning, two-stage interactive generation, and combination with the knowledge base. Extensive experiments show that LLMs can significantly benefit from our proposed methods and is able to achieve state-of-the-art performance in both automatic and human evaluations. Additionally, we explore the possibility of GPT-4 simulating human evaluators.

Harnessing the Power of Large Language Models for Empathetic Response Generation: Empirical Investigations and Improvements

TL;DR

Abstract

Paper Structure (28 sections, 9 equations, 2 figures, 8 tables)

This paper contains 28 sections, 9 equations, 2 figures, 8 tables.

Introduction
Related Work
Empathetic Response Generation
Large Language Models
Methodology
Overview
Preliminary Exploration
Advanced Exploration
Improvement via Semantically Similar In-Context Learning
Improvement via Two-stage Interactive Generation
Improvement via Knowledge Base
Experimental Setup
Dataset
Compared Models
Evaluation Metrics
...and 13 more sections

Figures (2)

Figure 1: An example of empathetic dialogue from the EmpatheticDialogues dataset.
Figure 2: The overall architecture and flow of our proposed methods for LLMs in empathetic dialogue generation.

Harnessing the Power of Large Language Models for Empathetic Response Generation: Empirical Investigations and Improvements

TL;DR

Abstract

Harnessing the Power of Large Language Models for Empathetic Response Generation: Empirical Investigations and Improvements

Authors

TL;DR

Abstract

Table of Contents

Figures (2)