Doc2Dial: A Framework for Dialogue Composition Grounded in Documents

Authors

  • Song Feng IBM Thomas J. Watson Research Center
  • Kshitij Fadnis IBM Thomas J. Watson Research Center
  • Q. Vera Liao IBM Thomas J. Watson Research Center
  • Luis A. Lastras IBM Thomas J. Watson Research Center

DOI:

https://doi.org/10.1609/aaai.v34i09.7089

Abstract

We introduce Doc2Dial, an end-to-end framework for generating conversational data grounded in given documents. It takes the documents as input and generates the pipelined tasks for obtaining the annotations specifically for producing the simulated dialog flows. Then, the dialog flows are used to guide the collection of the utterances via the integrated crowdsourcing tool. The outcomes include the human-human dialogue data grounded in the given documents, as well as various types of automatically or human labeled annotations that help ensure the quality of the dialog data with the flexibility to (re)composite dialogues. We expect such data can facilitate building automated dialogue agents for goal-oriented tasks. We demonstrate Doc2Dial system with the various domain documents for customer care.

Downloads

Published

2020-04-03

How to Cite

Feng, S., Fadnis, K., Liao, Q. V., & Lastras, L. A. (2020). Doc2Dial: A Framework for Dialogue Composition Grounded in Documents. Proceedings of the AAAI Conference on Artificial Intelligence, 34(09), 13604-13605. https://doi.org/10.1609/aaai.v34i09.7089