Summarization Attack via Paraphrasing (Student Abstract)


  • Jiyao Li University of Technology Sydney
  • Wei Liu University of Technology Sydney



Transformer, Adveresarial Attack, Abstractive Summarization


Many natural language processing models are perceived to be fragile on adversarial attacks. Recent work on adversarial attack has demonstrated a high success rate on sentiment analysis as well as classification models. However, attacks to summarization models have not been well studied. Summarization tasks are rarely influenced by word substitution, since advanced abstractive summary models utilize sentence level information. In this paper, we propose a paraphrasing-based attack method to attack summarization models. We first rank the sentences in the document according to their impacts to summarization. Then, we apply paraphrasing procedure to generate adversarial samples. Finally, we test our algorithm on benchmarks datasets against others methods. Our approach achieved the highest success rate and the lowest sentence substitution rate. In addition, the adversarial samples have high semantic similarity with the original sentences.




How to Cite

Li, J., & Liu, W. (2023). Summarization Attack via Paraphrasing (Student Abstract). Proceedings of the AAAI Conference on Artificial Intelligence, 37(13), 16250-16251.