Communication is All You Need: Persuasion Dataset Construction via Multi-LLM Communication
Weicheng Ma, Hefan Zhang, Ivory Yang, Shiyu Ji, Joice Chen, Farnoosh Hashemi, Shubham Mohole, Ethan Gearey, Michael Macy, Saeed Hassanpour, Soroush Vosoughi
TL;DR
This work tackles the scarcity and quality concerns of persuasion data by introducing a fully automated, multi-agent framework that generates and annotates persuasive dialogues. It combines six agent roles (dialogue generation, utterance quality monitoring, language refinement, persuasiveness annotation, global regulation, and postprocessing) across a GPT-3.5/GPT-4 backbone, incorporating NormBank taboos for ethically challenging scenarios and continuous, sentence-level perspective labeling. The approach demonstrates high-quality outputs with strong naturalness, coherent dialogic flow, and diverse persuasion strategies, while enabling strategy-controlled and multi-party dialogue generation. Validation includes extensive human-in-the-loop evaluations and model-assisted analyses, underscoring the framework’s scalability, generalizability, and potential to advance research in computational and social science domains, with careful attention to ethical use and misinformation risks. The resulting resource and methodology offer a practical pathway to study persuasive communication at scale and across complex contexts.
Abstract
Large Language Models (LLMs) have shown proficiency in generating persuasive dialogue, yet concerns about the fluency and sophistication of their outputs persist. This paper presents a multi-LLM communication framework designed to enhance the generation of persuasive data automatically. This framework facilitates the efficient production of high-quality, diverse linguistic content with minimal human oversight. Through extensive evaluations, we demonstrate that the generated data excels in naturalness, linguistic diversity, and the strategic use of persuasion, even in complex scenarios involving social taboos. The framework also proves adept at generalizing across novel contexts. Our results highlight the framework's potential to significantly advance research in both computational and social science domains concerning persuasive communication.
