Dialog Flow Induction for Constrainable LLM-Based Chatbots
Stuti Agrawal, Nishi Uppuluri, Pranav Pillai, Revanth Gangi Reddy, Zoey Li, Gokhan Tur, Dilek Hakkani-Tur, Heng Ji
TL;DR
The paper tackles the problem of constraining LLM-based chatbots to domain-specific conversations by automatically inducing domain-aware dialog flows. It introduces two unsupervised variants: intrinsic flows, built from GPT-4 knowledge with self-refinement, and data-guided flows, grounded in actual domain dialogs and merged with the intrinsic flow to maximize coverage. Through human and automatic evaluations across multiple task-oriented domains, the data-guided flow demonstrates improved domain coverage while preserving coherence and conclusiveness, illustrating the value of grounding flows in real conversations. The work provides a practical blueprint for integrating flow-controlled schemas into LLM chatbots and discusses limitations and avenues for future improvement in broader, open-ended domains.
Abstract
LLM-driven dialog systems are used in a diverse set of applications, ranging from healthcare to customer service. However, given their generalization capability, it is difficult to ensure that these chatbots stay within the boundaries of the specialized domains, potentially resulting in inaccurate information and irrelevant responses. This paper introduces an unsupervised approach for automatically inducing domain-specific dialog flows that can be used to constrain LLM-based chatbots. We introduce two variants of dialog flow based on the availability of in-domain conversation instances. Through human and automatic evaluation over various dialog domains, we demonstrate that our high-quality data-guided dialog flows achieve better domain coverage, thereby overcoming the need for extensive manual crafting of such flows.
