Don’t Stop the Multi-Party! On Generating Synthetic Written Multi-Party Conversations with Constraints

Authors

  • Nicolò Penzo Fondazione Bruno Kessler, Italy University of Trento, Italy
  • Marco Guerini Fondazione Bruno Kessler, Italy
  • Bruno Lepri Fondazione Bruno Kessler, Italy
  • Goran Glavaš Center For Artificial Intelligence and Data Science, University of Würzburg, Germany
  • Sara Tonelli Fondazione Bruno Kessler, Italy

DOI:

https://doi.org/10.1609/aaai.v40i39.40548

Abstract

Written Multi-Party Conversations (WMPCs) are widely studied across disciplines, with social media as a primary data source due to their accessibility. However, these datasets raise privacy concerns and often reflect platform-specific properties. For example, interactions between speakers may be limited due to rigid platform structures (e.g., threads, tree-like discussions), which yield overly simplistic interaction patterns (e.g., one-to-one ``reply-to'' links). This work explores the feasibility of generating synthetic WMPCs with instruction-tuned Large Language Models (LLMs) by providing deterministic constraints such as dialogue structure and participants’ stance. We investigate two complementary strategies of leveraging LLMs in this context: (i.) LLMs as WMPC generators, where we task the LLM to generate a whole WMPC at once and (ii.) LLMs as WMPC parties, where the LLM generates one turn of the conversation at a time (made of speaker, addressee and message), provided the conversation history. We next introduce an analytical framework to evaluate compliance with the constraints, content quality, and interaction complexity for both strategies. Finally, we assess the level of obtained WMPCs via human and LLM-as-a-judge evaluations. We find stark differences among LLMs, with only some being able to generate high-quality WMPCs. We also find that turn-by-turn generation yields better conformance to constraints and higher linguistic variability than generating WMPCs in one pass. Nonetheless, our structural and qualitative evaluation indicates that both generation strategies can yield high-quality WMPCs.

Downloads

Published

2026-03-14

How to Cite

Penzo, N., Guerini, M., Lepri, B., Glavaš, G., & Tonelli, S. (2026). Don’t Stop the Multi-Party! On Generating Synthetic Written Multi-Party Conversations with Constraints. Proceedings of the AAAI Conference on Artificial Intelligence, 40(39), 32701–32709. https://doi.org/10.1609/aaai.v40i39.40548

Issue

Section

AAAI Technical Track on Natural Language Processing IV