Salience Estimation via Variational Auto-Encoders for Multi-Document Summarization

Authors

  • Piji Li The Chinese University of Hong Kong
  • Zihao Wang The Chinese University of Hong Kong
  • Wai Lam The Chinese University of Hong Kong
  • Zhaochun Ren University College London
  • Lidong Bing AI Platform Department, Tencent Inc.

DOI:

https://doi.org/10.1609/aaai.v31i1.11007

Keywords:

Multi-Document Summarization, Variational Auto-Encoders

Abstract

We propose a new unsupervised sentence salience framework for Multi-Document Summarization (MDS), which can be divided into two components: latent semantic modeling and salience estimation. For latent semantic modeling, a neural generative model called Variational Auto-Encoders (VAEs) is employed to describe the observed sentences and the corresponding latent semantic representations. Neural variational inference is used for the posterior inference of the latent variables. For salience estimation, we propose an unsupervised data reconstruction framework, which jointly considers the reconstruction for latent semantic space and observed term vector space. Therefore, we can capture the salience of sentences from these two different and complementary vector spaces. Thereafter, the VAEs-based latent semantic model is integrated into the sentence salience estimation component in a unified fashion, and the whole framework can be trained jointly by back-propagation via multi-task learning. Experimental results on the benchmark datasets DUC and TAC show that our framework achieves better performance than the state-of-the-art models.

Downloads

Published

2017-02-12

How to Cite

Li, P., Wang, Z., Lam, W., Ren, Z., & Bing, L. (2017). Salience Estimation via Variational Auto-Encoders for Multi-Document Summarization. Proceedings of the AAAI Conference on Artificial Intelligence, 31(1). https://doi.org/10.1609/aaai.v31i1.11007