Multiresolution Recurrent Neural Networks: An Application to Dialogue Response Generation

Iulian Serban; Tim Klinger; Gerald Tesauro; Kartik Talamadupula; Bowen Zhou; Yoshua Bengio; Aaron Courville

doi:10.1609/aaai.v31i1.10984

Authors

Iulian Serban University of Montreal
Tim Klinger IBM
Gerald Tesauro IBM
Kartik Talamadupula IBM
Bowen Zhou IBM
Yoshua Bengio University of Montreal
Aaron Courville University of Montreal

DOI:

https://doi.org/10.1609/aaai.v31i1.10984

Keywords:

Dialogue System, Conversational System, Chatbot, Neural Network, Deep Learning, Generative Models, Variational Autoencoder, Latent Variable Model, Variational Learning, Technical Support, Ubuntu

Abstract

We introduce a new class of models called multiresolution recurrent neural networks, which explicitly model natural language generation at multiple levels of abstraction. The models extend the sequence-to-sequence framework to generate two parallel stochastic processes: a sequence of high-level coarse tokens, and a sequence of natural language words (e.g. sentences). The coarse sequences follow a latent stochastic process with a factorial representation, which helps the models generalize to new examples. The coarse sequences can also incorporate task-specific knowledge, when available. In our experiments, the coarse sequences are extracted using automatic procedures, which are designed to capture compositional structure and semantics. These procedures enable training the multiresolution recurrent neural networks by maximizing the exact joint log-likelihood over both sequences. We apply the models to dialogue response generation in the technical support domain and compare them with several competing models. The multiresolution recurrent neural networks outperform competing models by a substantial margin, achieving state-of-the-art results according to both a human evaluation study and automatic evaluation metrics. Furthermore, experiments show the proposed models generate more fluent, relevant and goal-oriented responses.

Multiresolution Recurrent Neural Networks: An Application to Dialogue Response Generation

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Subscription