Summarization Attack via Paraphrasing (Student Abstract)

Authors

  • Jiyao Li University of Technology Sydney
  • Wei Liu University of Technology Sydney

DOI:

https://doi.org/10.1609/aaai.v37i13.26985

Keywords:

Transformer, Adveresarial Attack, Abstractive Summarization

Abstract

Many natural language processing models are perceived to be fragile on adversarial attacks. Recent work on adversarial attack has demonstrated a high success rate on sentiment analysis as well as classification models. However, attacks to summarization models have not been well studied. Summarization tasks are rarely influenced by word substitution, since advanced abstractive summary models utilize sentence level information. In this paper, we propose a paraphrasing-based attack method to attack summarization models. We first rank the sentences in the document according to their impacts to summarization. Then, we apply paraphrasing procedure to generate adversarial samples. Finally, we test our algorithm on benchmarks datasets against others methods. Our approach achieved the highest success rate and the lowest sentence substitution rate. In addition, the adversarial samples have high semantic similarity with the original sentences.

Downloads

Published

2023-09-06

How to Cite

Li, J., & Liu, W. (2023). Summarization Attack via Paraphrasing (Student Abstract). Proceedings of the AAAI Conference on Artificial Intelligence, 37(13), 16250-16251. https://doi.org/10.1609/aaai.v37i13.26985