TextGAIL: Generative Adversarial Imitation Learning for Text Generation

Qingyang Wu; Lei Li; Zhou Yu

doi:10.1609/aaai.v35i16.17656

Authors

Qingyang Wu University of California, Davis
Lei Li ByteDance AI Lab
Zhou Yu University of California, Davis

DOI:

https://doi.org/10.1609/aaai.v35i16.17656

Keywords:

Generation

Abstract

Generative Adversarial Networks (GANs) for text generation have recently received many criticisms, as they perform worse than their MLE counterparts. We suspect previous text GANs' inferior performance is due to the lack of a reliable guiding signal in their discriminators. To address this problem, we propose a generative adversarial imitation learning framework for text generation that uses large pre-trained language models to provide more reliable reward guidance. As previous text GANs suffer from high variance of gradients, we apply contrastive discriminator, and proximal policy optimization (PPO) to stabilize and improve text generation performance. For evaluation, we conduct experiments on a diverse set of unconditional and conditional text generation tasks. Experimental results show that TextGAIL achieves better performance in terms of both quality and diversity than the MLE baseline. We also validate our intuition that TextGAIL's discriminator demonstrates the capability of providing reasonable rewards with an additional task.

TextGAIL: Generative Adversarial Imitation Learning for Text Generation

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Subscription