Span Graph Transformer for Document-Level Named Entity Recognition

Authors

  • Hongli Mao Beijing Institute of Technology
  • Xian-Ling Mao Beijing Institute of Technology
  • Hanlin Tang Beijing Institute of Technology
  • Yu-Ming Shang Beijing University of Posts and Telecommunications
  • Heyan Huang Beijing Institute of Technology

DOI:

https://doi.org/10.1609/aaai.v38i17.29841

Keywords:

NLP: Information Extraction

Abstract

Named Entity Recognition (NER), which aims to identify the span and category of entities within text, is a fundamental task in natural language processing. Recent NER approaches have featured pre-trained transformer-based models (e.g., BERT) as a crucial encoding component to achieve state-of-the-art performance. However, due to the length limit for input text, these models typically consider text at the sentence-level and cannot capture the long-range contextual dependency within a document. To address this issue, we propose a novel Span Graph Transformer (SGT) method for document-level NER, which constructs long-range contextual dependencies at both the token and span levels. Specifically, we first retrieve relevant contextual sentences in the document for each target sentence, and jointly encode them by BERT to capture token-level dependencies. Then, our proposed model extracts candidate spans from each sentence and integrates these spans into a document-level span graph, where nested spans within sentences and identical spans across sentences are connected. By leveraging the power of Graph Transformer and well-designed position encoding, our span graph can fully exploit span-level dependencies within the document. Extensive experiments on both resource-rich nested and flat NER datasets, as well as low-resource distantly supervised NER datasets, demonstrate that proposed SGT model achieves better performance than previous state-of-the-art models.

Published

2024-03-24

How to Cite

Mao, H., Mao, X.-L., Tang, H., Shang, Y.-M., & Huang, H. (2024). Span Graph Transformer for Document-Level Named Entity Recognition. Proceedings of the AAAI Conference on Artificial Intelligence, 38(17), 18769–18777. https://doi.org/10.1609/aaai.v38i17.29841

Issue

Section

AAAI Technical Track on Natural Language Processing II