Generating Diversified Comments via Reader-Aware Topic Modeling and Saliency Detection

Authors

  • Wei Wang Tsinghua University
  • Piji Li Tencent AI Lab
  • Hai-Tao Zheng Tsinghua University

Keywords:

Generation

Abstract

Automatic comment generation is a special and challenging task to verify the model ability on news content comprehension and language generation. Comments not only convey salient and interesting information in news articles, but also imply various and different reader characteristics which we treat as the essential clues for diversity. However, most of the comment generation approaches only focus on saliency information extraction, while the reader-aware factors implied by comments are neglected. To address this issue, we propose a unified reader-aware topic modeling and saliency information detection framework to enhance the quality of generated comments. For reader-aware topic modeling, we design a variational generative clustering algorithm for latent semantic learning and topic mining from reader comments. For saliency information detection, we introduce Bernoulli distribution estimating on news content to select saliency information. The obtained topic representations as well as the selected saliency information are incorporated into the decoder to generate diversified and informative comments. Experimental results on three datasets show that our framework outperforms existing baseline methods in terms of both automatic metrics and human evaluation. The potential ethical issues are also discussed in detail.

Downloads

Published

2021-05-18

How to Cite

Wang, W., Li, P., & Zheng, H.-T. (2021). Generating Diversified Comments via Reader-Aware Topic Modeling and Saliency Detection. Proceedings of the AAAI Conference on Artificial Intelligence, 35(16), 13988-13996. Retrieved from https://ojs.aaai.org/index.php/AAAI/article/view/17647

Issue

Section

AAAI Technical Track on Speech and Natural Language Processing III