Generating Coherent Narratives by Learning Dynamic and Discrete Entity States with a Contrastive Framework

Authors

  • Jian Guan The CoAI group, DCST Institute for Artificial Intelligence State Key Lab of Intelligent Technology and Systems Beijing National Research Center for Information Science and Technology Tsinghua University, Beijing 100084, China
  • Zhenyu Yang Guangdong OPPO Mobile Telecommunications Corp., Ltd.
  • Rongsheng Zhang Fuxi AI Lab, NetEase Inc., Hangzhou, China
  • Zhipeng Hu Fuxi AI Lab, NetEase Inc., Hangzhou, China
  • Minlie Huang The CoAI group, DCST Institute for Artificial Intelligence State Key Lab of Intelligent Technology and Systems Beijing National Research Center for Information Science and Technology Tsinghua University, Beijing 100084, China

DOI:

https://doi.org/10.1609/aaai.v37i11.26509

Keywords:

SNLP: Generation

Abstract

Despite advances in generating fluent texts, existing pretraining models tend to attach incoherent event sequences to involved entities when generating narratives such as stories and news. We conjecture that such issues result from representing entities as static embeddings of superficial words, while neglecting to model their ever-changing states, i.e., the information they carry, as the text unfolds. Therefore, we extend the Transformer model to dynamically conduct entity state updates and sentence realization for narrative generation. We propose a contrastive framework to learn the state representations in a discrete space, and insert additional attention layers into the decoder to better exploit these states. Experiments on two narrative datasets show that our model can generate more coherent and diverse narratives than strong baselines with the guidance of meaningful entity states.

Downloads

Published

2023-06-26

How to Cite

Guan, J., Yang, Z., Zhang, R., Hu, Z., & Huang, M. (2023). Generating Coherent Narratives by Learning Dynamic and Discrete Entity States with a Contrastive Framework. Proceedings of the AAAI Conference on Artificial Intelligence, 37(11), 12836-12844. https://doi.org/10.1609/aaai.v37i11.26509

Issue

Section

AAAI Technical Track on Speech & Natural Language Processing