Unsupervised Summarization for Chat Logs with Topic-Oriented Ranking and Context-Aware Auto-Encoders

Yicheng Zou; Jun Lin; Lujun Zhao; Yangyang Kang; Zhuoren Jiang; Changlong Sun; Qi Zhang; Xuanjing Huang; Xiaozhong Liu

doi:10.1609/aaai.v35i16.17724

Authors

Yicheng Zou Fudan University
Jun Lin Alibaba Group
Lujun Zhao Alibaba Group
Yangyang Kang Alibaba Group
Zhuoren Jiang Zhejiang University
Changlong Sun Alibaba Group Zhejiang University
Qi Zhang Fudan University
Xuanjing Huang Fudan University
Xiaozhong Liu Indiana University Bloomington

DOI:

https://doi.org/10.1609/aaai.v35i16.17724

Keywords:

Summarization, Applications

Abstract

Automatic chat summarization can help people quickly grasp important information from numerous chat messages. Unlike conventional documents, chat logs usually have fragmented and evolving topics. In addition, these logs contain a quantity of elliptical and interrogative sentences, which make the chat summarization highly context dependent. In this work, we propose a novel unsupervised framework called RankAE to perform chat summarization without employing manually labeled data. RankAE consists of a topic-oriented ranking strategy that selects topic utterances according to centrality and diversity simultaneously, as well as a denoising auto-encoder that is carefully designed to generate succinct but context-informative summaries based on the selected utterances. To evaluate the proposed method, we collect a large-scale dataset of chat logs from a customer service environment and build an annotated set only for model evaluation. Experimental results show that RankAE significantly outperforms other unsupervised methods and is able to generate high-quality summaries in terms of relevance and topic coverage.

Unsupervised Summarization for Chat Logs with Topic-Oriented Ranking and Context-Aware Auto-Encoders

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Subscription