Table of Contents
Fetching ...

Don't Stop the Multi-Party! On Generating Synthetic Multi-Party Conversations with Constraints

Nicolò Penzo, Marco Guerini, Bruno Lepri, Goran Glavaš, Sara Tonelli

TL;DR

The paper tackles privacy-driven concerns and the lack of structural diversity in multi-party conversations (MPCs) by examining constraint-guided synthetic MPC generation. It compares One-Long and Turn-by-Turn generation strategies across four instruction-based sources, using 38 polarized topics with paired stances and a comprehensive evaluation framework. Results show Turn-by-Turn generation improves constraint conformance and linguistic variability, with models like Qwen2.5 and Llama3.1 yielding high-quality, structurally rich MPCs that surpass a real UbuntuIRC corpus in interaction complexity. The work demonstrates the viability of scalable, constraint-aware MPC synthesis for dataset creation and downstream fine-tuning, while noting limitations in language, topic scope, and computation, and outlining directions for broader applicability.

Abstract

Multi-Party Conversations (MPCs) are widely studied across disciplines, with social media as a primary data source due to their accessibility. However, these datasets raise privacy concerns and often reflect platform-specific properties. For example, interactions between speakers may be limited due to rigid platform structures (e.g., threads, tree-like discussions), which yield overly simplistic interaction patterns (e.g., as a consequence of ``reply-to'' links). This work explores the feasibility of generating diverse MPCs with instruction-tuned Large Language Models (LLMs) by providing deterministic constraints such as dialogue structure and participants' stance. We investigate two complementary strategies of leveraging LLMs in this context: (i.) LLMs as MPC generators, where we task the LLM to generate a whole MPC at once and (ii.) LLMs as MPC parties, where the LLM generates one turn of the conversation at a time, provided the conversation history. We next introduce an analytical framework to evaluate compliance with the constraints, content quality, and interaction complexity for both strategies. Finally, we assess the quality of obtained MPCs via human annotation and LLM-as-a-judge evaluations. We find stark differences among LLMs, with only some being able to generate high-quality MPCs. We also find that turn-by-turn generation yields better conformance to constraints and higher linguistic variability than generating MPCs in one pass. Nonetheless, our structural and qualitative evaluation indicates that both generation strategies can yield high-quality MPCs.

Don't Stop the Multi-Party! On Generating Synthetic Multi-Party Conversations with Constraints

TL;DR

The paper tackles privacy-driven concerns and the lack of structural diversity in multi-party conversations (MPCs) by examining constraint-guided synthetic MPC generation. It compares One-Long and Turn-by-Turn generation strategies across four instruction-based sources, using 38 polarized topics with paired stances and a comprehensive evaluation framework. Results show Turn-by-Turn generation improves constraint conformance and linguistic variability, with models like Qwen2.5 and Llama3.1 yielding high-quality, structurally rich MPCs that surpass a real UbuntuIRC corpus in interaction complexity. The work demonstrates the viability of scalable, constraint-aware MPC synthesis for dataset creation and downstream fine-tuning, while noting limitations in language, topic scope, and computation, and outlining directions for broader applicability.

Abstract

Multi-Party Conversations (MPCs) are widely studied across disciplines, with social media as a primary data source due to their accessibility. However, these datasets raise privacy concerns and often reflect platform-specific properties. For example, interactions between speakers may be limited due to rigid platform structures (e.g., threads, tree-like discussions), which yield overly simplistic interaction patterns (e.g., as a consequence of ``reply-to'' links). This work explores the feasibility of generating diverse MPCs with instruction-tuned Large Language Models (LLMs) by providing deterministic constraints such as dialogue structure and participants' stance. We investigate two complementary strategies of leveraging LLMs in this context: (i.) LLMs as MPC generators, where we task the LLM to generate a whole MPC at once and (ii.) LLMs as MPC parties, where the LLM generates one turn of the conversation at a time, provided the conversation history. We next introduce an analytical framework to evaluate compliance with the constraints, content quality, and interaction complexity for both strategies. Finally, we assess the quality of obtained MPCs via human annotation and LLM-as-a-judge evaluations. We find stark differences among LLMs, with only some being able to generate high-quality MPCs. We also find that turn-by-turn generation yields better conformance to constraints and higher linguistic variability than generating MPCs in one pass. Nonetheless, our structural and qualitative evaluation indicates that both generation strategies can yield high-quality MPCs.

Paper Structure

This paper contains 27 sections, 13 figures, 7 tables.

Figures (13)

  • Figure 1: Example of a turn in a synthetic MPC.
  • Figure 2: Overview of the metrics considered in our structural analysis.
  • Figure 3: Empirical Cumulative Density Function (ECDF) of structural analysis of synthetic MPCs from Qwen2.5-TT. All the boxes goes from $0$ to $1$ on both vertical axis (density) and horizontal axis (value of the metric).
  • Figure 4: The two versions of Task Description for the One-Long generation strategy
  • Figure 5: The two examples of Output Format for the One-Long generation strategy
  • ...and 8 more figures