Cohesive Conversations: Enhancing Authenticity in Multi-Agent Simulated Dialogues
KuanChao Chu, Yi-Pei Chen, Hideki Nakayama
TL;DR
The paper tackles pervasive quality degradation in multi-agent dialogues powered by LLMs, identifying repetition, inconsistency, and hallucination as core, time-propagating issues. It introduces the Screening, Diagnosis, and Regeneration (SDR) framework, which uses evidence gathering from past dialogues, a Natural Language Inference-Graph (NLI-G) for inconsistency, and iterative regeneration to produce more diverse, consistent, and factual conversations. Through experiments on the OneDayLife dataset, SDR demonstrates superior corpus-level diversity, factualness, consistency, and fluency, while reducing repetitive keyword usage and preserving dialogue integrity over time. The work provides a scalable, on-the-fly correction approach that sets a new standard for dialogue quality in open-domain multi-agent simulations and informs future research on robust, long-horizon agent interactions.
Abstract
This paper investigates the quality of multi-agent dialogues in simulations powered by Large Language Models (LLMs). Analyzing dialogues and memory over multiple sessions revealed significant issues such as repetition, inconsistency, and hallucination, exacerbated by the propagation of erroneous information. To combat these challenges, we propose a novel Screening, Diagnosis, and Regeneration (SDR) framework that detects and corrects utterance errors through a comprehensive process involving immediate issue identification, evidence gathering from past dialogues, and LLM analysis for utterance revision. By incorporating our SDR framework to Generative Agents (Park et al., 2023), we enhance the diversity, consistency, and factualness of the generated dialogues. This work presents a pioneering approach to enhancing dialogue quality in multi-agent simulations, establishing a new standard for future research in the field.
