A Deep Generative Framework for Paraphrase Generation

Ankush Gupta; Arvind Agarwal; Prawaan Singh; Piyush Rai

doi:10.1609/aaai.v32i1.11956

Authors

Ankush Gupta IBM Research India
Arvind Agarwal IBM Research India
Prawaan Singh Indian Institute of Technology, Kanpur
Piyush Rai Indian Institute of Technology, Kanpur

DOI:

https://doi.org/10.1609/aaai.v32i1.11956

Keywords:

Paraphrase generation, variational autoencoders, question paraphrase

Abstract

Paraphrase generation is an important problem in NLP, especially in question answering, information retrieval, information extraction, conversation systems, to name a few. In this paper, we address the problem of generating paraphrases automatically. Our proposed method is based on a combination of deep generative models (VAE) with sequence-to-sequence models (LSTM) to generate paraphrases, given an input sentence. Traditional VAEs when combined with recurrent neural networks can generate free text but they are not suitable for paraphrase generation for a given sentence. We address this problem by conditioning the both, encoder and decoder sides of VAE, on the original sentence, so that it can generate the given sentence's paraphrases. Unlike most existing models, our model is simple, modular and can generate multiple paraphrases, for a given sentence. Quantitative evaluation of the proposed method on a benchmark paraphrase dataset demonstrates its efficacy, and its performance improvement over the state-of-the-art methods by a significant margin, whereas qualitative human evaluation indicate that the generated paraphrases are well-formed, grammatically correct, and are relevant to the input sentence. Furthermore, we evaluate our method on a newly released question paraphrase dataset, and establish a new baseline for future research.

A Deep Generative Framework for Paraphrase Generation

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription