Dynamic Modeling Cross- and Self-Lattice Attention Network for Chinese NER

Shan Zhao; Minghao Hu; Zhiping Cai; Haiwen Chen; Fang Liu

doi:10.1609/aaai.v35i16.17706

Authors

Shan Zhao College of Computer, National University of Defense Technology
Minghao Hu Information Research Center of Military Science, PLA Academy of Military Science
Zhiping Cai College of Computer, National University of Defense Technology
Haiwen Chen National University of Defense Technology
Fang Liu School of Design, Hunan University

DOI:

https://doi.org/10.1609/aaai.v35i16.17706

Keywords:

Information Extraction

Abstract

Word-character lattice models have been proved to be effective for Chinese named entity recognition (NER), in which word boundary information is fused into character sequences for enhancing character representations. However, prior approaches have only used simple methods such as feature concatenation or position encoding to integrate word-character lattice information, but fail to capture fine-grained correlations in word-character spaces. In this paper, we propose DCSAN, a Dynamic Cross- and Self-lattice Attention Network that aims to model dense interactions over word-character lattice structure for Chinese NER. By carefully combining cross-lattice and self-lattice attention modules with gated word-character semantic fusion unit, the network can explicitly capture fine-grained correlations across different spaces (e.g., word-to-character and character-to-character), thus significantly improving model performance. Experiments on four Chinese NER datasets show that DCSAN obtains stateof-the-art results as well as efficiency compared to several competitive approaches.

Dynamic Modeling Cross- and Self-Lattice Attention Network for Chinese NER

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription