Table of Contents
Fetching ...

StyleChat: Learning Recitation-Augmented Memory in LLMs for Stylized Dialogue Generation

Jinpeng Li, Zekai Zhang, Quan Tu, Xin Cheng, Dongyan Zhao, Rui Yan

TL;DR

Experimental results show that the proposed framework StyleChat outperforms all the baselines and helps to break the style boundary of LLMs, which has been carefully constructed with rigorous human-led quality control.

Abstract

Large Language Models (LLMs) demonstrate superior performance in generative scenarios and have attracted widespread attention. Among them, stylized dialogue generation is essential in the context of LLMs for building intelligent and engaging dialogue agent. However the ability of LLMs is data-driven and limited by data bias, leading to poor performance on specific tasks. In particular, stylized dialogue generation suffers from a severe lack of supervised data. Furthermore, although many prompt-based methods have been proposed to accomplish specific tasks, their performance in complex real-world scenarios involving a wide variety of dialog styles further enhancement. In this work, we first introduce a stylized dialogue dataset StyleEval with 38 styles by leveraging the generative power of LLMs comprehensively, which has been carefully constructed with rigorous human-led quality control. Based on this, we propose the stylized dialogue framework StyleChat via recitation-augmented memory strategy and multi-task style learning strategy to promote generalization ability. To evaluate the effectiveness of our approach, we created a test benchmark that included both a generation task and a choice task to comprehensively evaluate trained models and assess whether styles and preferences are remembered and understood. Experimental results show that our proposed framework StyleChat outperforms all the baselines and helps to break the style boundary of LLMs.

StyleChat: Learning Recitation-Augmented Memory in LLMs for Stylized Dialogue Generation

TL;DR

Experimental results show that the proposed framework StyleChat outperforms all the baselines and helps to break the style boundary of LLMs, which has been carefully constructed with rigorous human-led quality control.

Abstract

Large Language Models (LLMs) demonstrate superior performance in generative scenarios and have attracted widespread attention. Among them, stylized dialogue generation is essential in the context of LLMs for building intelligent and engaging dialogue agent. However the ability of LLMs is data-driven and limited by data bias, leading to poor performance on specific tasks. In particular, stylized dialogue generation suffers from a severe lack of supervised data. Furthermore, although many prompt-based methods have been proposed to accomplish specific tasks, their performance in complex real-world scenarios involving a wide variety of dialog styles further enhancement. In this work, we first introduce a stylized dialogue dataset StyleEval with 38 styles by leveraging the generative power of LLMs comprehensively, which has been carefully constructed with rigorous human-led quality control. Based on this, we propose the stylized dialogue framework StyleChat via recitation-augmented memory strategy and multi-task style learning strategy to promote generalization ability. To evaluate the effectiveness of our approach, we created a test benchmark that included both a generation task and a choice task to comprehensively evaluate trained models and assess whether styles and preferences are remembered and understood. Experimental results show that our proposed framework StyleChat outperforms all the baselines and helps to break the style boundary of LLMs.
Paper Structure (20 sections, 4 equations, 7 figures, 15 tables)

This paper contains 20 sections, 4 equations, 7 figures, 15 tables.

Figures (7)

  • Figure 1: Examples of stylized dialogue generation by different LLMs. The progress bar represents the quality in the particular style.
  • Figure 2: The workflow for developing StyleEval. LLM is employed to generate statistical-level style profile for a certain style. Then to extract linguistic-level style profile based on examples and linguistic knowledge. Finally, we produce stylized dialogue with context and style profile, verified by human to guarantee quality.
  • Figure 3: The overview of our proposed StyleChat Framework. During training phase, our model is instructed to first recite the style profile then respond with reference to the recited style profile. During Inference, model recalls or derives profiles from parametric memory and then respond with style. Our two-stage framework teaches model to learn implicit Chain of Thought process, resulting in better generalization abilities through chains of style thoughts.
  • Figure 4: The multiple choice evaluation, where y-axis represents accuracy and x-axis lists different models.
  • Figure 5: The evaluation of stylized dialogue generation of LLMs.
  • ...and 2 more figures