Stephanie: Step-by-Step Dialogues for Mimicking Human Interactions in Social Conversations
Hao Yang, Hongyuan Lu, Xinhua Zeng, Yang Liu, Xiang Zhang, Haoran Yang, Yumeng Zhang, Shan Huang, Yiran Wei, Wai Lam
TL;DR
The paper tackles the rigidity of single-step dialogue systems by introducing Stephanie, a Step-by-Step Dialogue Paradigm that mimics ongoing human conversations. It presents a dual learning prompt framework with background information $D$, positive objectives $P$, and negative objectives $N$, optimizing $p(r|D, P, N)$, and a Further-Split post-editing method to produce more natural, multi-turn exchanges. Using PERSONA-CHAT as a base, it generates a 5,457-dialogue incremental dataset with Llama3-70b and demonstrates a plug-and-play finetuning approach to create Stephanie-enabled LLMs, outperforming single-step baselines on engagement, naturalness, and diversity. Evaluations—both automated and human—show step-by-step dialogues yield higher quality and more human-like interactions, with Stephanie achieving the strongest results. The work releases code, Stephanie datasets, and Stephanie LLMs to accelerate research and practical deployment of more natural chatbot systems.
Abstract
In the rapidly evolving field of natural language processing, dialogue systems primarily employ a single-step dialogue paradigm. Although this paradigm is efficient, it lacks the depth and fluidity of human interactions and does not appear natural. We introduce a novel \textbf{Step}-by-Step Dialogue Paradigm (Stephanie), designed to mimic the ongoing dynamic nature of human conversations. By employing a dual learning strategy and a further-split post-editing method, we generated and utilized a high-quality step-by-step dialogue dataset to fine-tune existing large language models, enabling them to perform step-by-step dialogues. We thoroughly present Stephanie. Tailored automatic and human evaluations are conducted to assess its effectiveness compared to the traditional single-step dialogue paradigm. We will release code, Stephanie datasets, and Stephanie LLMs to facilitate the future of chatbot eras.
