Conversations in the Crowd: Collecting Data for Task-Oriented Dialog Learning

Authors

  • Walter Lasecki University of Rochester
  • Ece Kamar Microsoft Research
  • Dan Bohus Microsoft Research

DOI:

https://doi.org/10.1609/hcomp.v1i1.13092

Keywords:

conversational data, dialog data, data collection, crowdsourcing

Abstract

A major challenge in developing dialog systems is obtaining realistic data to train the systems for specific domains. We study the opportunity for using crowdsourcing methods to collect dialog datasets. Specifically, we introduce ChatCollect, a system that allows researchers to collect conversations focused around definable tasks from pairs of workers in the crowd. We demonstrate that varied and in-depth dialogs can be collected using this system, then discuss ongoing work on creating a crowd-powered system for parsing semantic frames. We then discuss research opportunities in using this approach to train and improve automated dialog systems in the future.

Downloads

Published

2013-11-03

How to Cite

Lasecki, W., Kamar, E., & Bohus, D. (2013). Conversations in the Crowd: Collecting Data for Task-Oriented Dialog Learning. Proceedings of the AAAI Conference on Human Computation and Crowdsourcing, 1(1), 2-5. https://doi.org/10.1609/hcomp.v1i1.13092

Issue

Section

Scaling Speech, Language Understanding and Dialogue through Crowdsourcing Workshop