Modeling Dialogues with Hashcode Representations: A Nonparametric Approach

Sahil Garg; Irina Rish; Guillermo Cecchi; Palash Goyal; Sarik Ghazarian; Shuyang Gao; Greg Ver Steeg; Aram Galstyan

doi:10.1609/aaai.v34i04.5813

Authors

Sahil Garg USC ISI
Irina Rish U of Montreal
Guillermo Cecchi IBM Research
Palash Goyal USC ISI
Sarik Ghazarian USC ISI
Shuyang Gao USC ISI
Greg Ver Steeg USC ISI
Aram Galstyan USC ISI

DOI:

https://doi.org/10.1609/aaai.v34i04.5813

Abstract

We propose a novel dialogue modeling framework, the first-ever nonparametric kernel functions based approach for dialogue modeling, which learns hashcodes as text representations; unlike traditional deep learning models, it handles well relatively small datasets, while also scaling to large ones. We also derive a novel lower bound on mutual information, used as a model-selection criterion favoring representations with better alignment between the utterances of participants in a collaborative dialogue setting, as well as higher predictability of the generated responses. As demonstrated on three real-life datasets, including prominently psychotherapy sessions, the proposed approach significantly outperforms several state-of-art neural network based dialogue systems, both in terms of computational efficiency, reducing training time from days or weeks to hours, and the response quality, achieving an order of magnitude improvement over competitors in frequency of being chosen as the best model by human evaluators.

Modeling Dialogues with Hashcode Representations: A Nonparametric Approach

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Subscription