Multiresolution Recurrent Neural Networks: An Application to Dialogue Response Generation

Authors

  • Iulian Serban University of Montreal
  • Tim Klinger IBM
  • Gerald Tesauro IBM
  • Kartik Talamadupula IBM
  • Bowen Zhou IBM
  • Yoshua Bengio University of Montreal
  • Aaron Courville University of Montreal

DOI:

https://doi.org/10.1609/aaai.v31i1.10984

Keywords:

Dialogue System, Conversational System, Chatbot, Neural Network, Deep Learning, Generative Models, Variational Autoencoder, Latent Variable Model, Variational Learning, Technical Support, Ubuntu

Abstract

We introduce a new class of models called multiresolution recurrent neural networks, which explicitly model natural language generation at multiple levels of abstraction. The models extend the sequence-to-sequence framework to generate two parallel stochastic processes: a sequence of high-level coarse tokens, and a sequence of natural language words (e.g. sentences). The coarse sequences follow a latent stochastic process with a factorial representation, which helps the models generalize to new examples. The coarse sequences can also incorporate task-specific knowledge, when available. In our experiments, the coarse sequences are extracted using automatic procedures, which are designed to capture compositional structure and semantics. These procedures enable training the multiresolution recurrent neural networks by maximizing the exact joint log-likelihood over both sequences. We apply the models to dialogue response generation in the technical support domain and compare them with several competing models. The multiresolution recurrent neural networks outperform competing models by a substantial margin, achieving state-of-the-art results according to both a human evaluation study and automatic evaluation metrics. Furthermore, experiments show the proposed models generate more fluent, relevant and goal-oriented responses.

Downloads

Published

2017-02-12

How to Cite

Serban, I., Klinger, T., Tesauro, G., Talamadupula, K., Zhou, B., Bengio, Y., & Courville, A. (2017). Multiresolution Recurrent Neural Networks: An Application to Dialogue Response Generation. Proceedings of the AAAI Conference on Artificial Intelligence, 31(1). https://doi.org/10.1609/aaai.v31i1.10984