Unity in Diversity: Learning Distributed Heterogeneous Sentence Representation for Extractive Summarization

Abhishek Singh; Manish Gupta; Vasudeva Varma

doi:10.1609/aaai.v32i1.11994

Unity in Diversity: Learning Distributed Heterogeneous Sentence Representation for Extractive Summarization

Authors

Abhishek Singh IIIT Hyderabad
Manish Gupta IIIT Hyderabad & Microsoft
Vasudeva Varma IIIT Hyderabad

DOI:

https://doi.org/10.1609/aaai.v32i1.11994

Keywords:

Natural Language, Summarization, Deep Learning

Abstract

Automated multi-document extractive text summarization is a widely studied research problem in the field of natural language understanding. Such extractive mechanisms compute in some form the worthiness of a sentence to be included into the summary. While the conventional approaches rely on human crafted document-independent features to generate a summary, we develop a data-driven novel summary system called HNet, which exploits the various semantic and compositional aspects latent in a sentence to capture document independent features. The network learns sentence representation in a way that, salient sentences are closer in the vector space than non-salient sentences. This semantic and compositional feature vector is then concatenated with the document-dependent features for sentence ranking. Experiments on the DUC benchmark datasets (DUC-2001, DUC-2002 and DUC-2004) indicate that our model shows significant performance gain of around 1.5-2 points in terms of ROUGE score compared with the state-of-the-art baselines.

Downloads

Published

2018-04-27

How to Cite

Singh, A., Gupta, M., & Varma, V. (2018). Unity in Diversity: Learning Distributed Heterogeneous Sentence Representation for Extractive Summarization. Proceedings of the AAAI Conference on Artificial Intelligence, 32(1). https://doi.org/10.1609/aaai.v32i1.11994

Download Citation

Issue

Vol. 32 No. 1 (2018): Thirty-Second AAAI Conference on Artificial Intelligence

Section

Main Track: NLP and Machine Learning

Unity in Diversity: Learning Distributed Heterogeneous Sentence Representation for Extractive Summarization

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information