Dialogue Rewriting via Skeleton-Guided Generation

Authors

  • Chunlei Xin Chinese Information Processing Laboratory, Institute of Software, Chinese Academy of Sciences University of Chinese Academy of Sciences
  • Hongyu Lin Chinese Information Processing Laboratory, Institute of Software, Chinese Academy of Sciences
  • Shan Wu Chinese Information Processing Laboratory, Institute of Software, Chinese Academy of Sciences University of Chinese Academy of Sciences
  • Xianpei Han Chinese Information Processing Laboratory, Institute of Software, Chinese Academy of Sciences State Key Laboratory of Computer Science, Institute of Software, Chinese Academy of Sciences
  • Bo Chen Chinese Information Processing Laboratory, Institute of Software, Chinese Academy of Sciences School of Information Engineering, Minzu University of China National Language Resources Monitoring and Research Center for Minority Languages
  • Wen Dai Xiaomi AI Lab, Xiaomi Inc.
  • Shuai Chen Xiaomi AI Lab, Xiaomi Inc.
  • Bin Wang Xiaomi AI Lab, Xiaomi Inc.
  • Le Sun Chinese Information Processing Laboratory, Institute of Software, Chinese Academy of Sciences State Key Laboratory of Computer Science, Institute of Software, Chinese Academy of Sciences

DOI:

https://doi.org/10.1609/aaai.v37i11.26619

Keywords:

SNLP: Conversational AI/Dialogue Systems

Abstract

Dialogue rewriting aims to transform multi-turn, context-dependent dialogues into well-formed, context-independent text for most NLP systems. Previous dialogue rewriting benchmarks and systems assume a fluent and informative utterance to rewrite. Unfortunately, dialogue utterances from real-world systems are frequently noisy and with various kinds of errors that can make them almost uninformative. In this paper, we first present Real-world Dialogue Rewriting Corpus (RealDia), a new benchmark to evaluate how well current dialogue rewriting systems can deal with real-world noisy and uninformative dialogue utterances. RealDia contains annotated multi-turn dialogues from real scenes with ASR errors, spelling errors, redundancies and other noises that are ignored by previous dialogue rewriting benchmarks. We show that previous dialogue rewriting approaches are neither effective nor data-efficient to resolve RealDia. Then this paper presents Skeleton-Guided Rewriter (SGR), which can resolve the task of dialogue rewriting via a skeleton-guided generation paradigm. Experiments show that RealDia is a much more challenging benchmark for real-world dialogue rewriting, and SGR can effectively resolve the task and outperform previous approaches by a large margin.

Downloads

Published

2023-06-26

How to Cite

Xin, C., Lin, H., Wu, S., Han, X., Chen, B., Dai, W., Chen, S., Wang, B., & Sun, L. (2023). Dialogue Rewriting via Skeleton-Guided Generation. Proceedings of the AAAI Conference on Artificial Intelligence, 37(11), 13825-13833. https://doi.org/10.1609/aaai.v37i11.26619

Issue

Section

AAAI Technical Track on Speech & Natural Language Processing