Adversarial Learning for Chinese NER From Crowd Annotations

Authors

  • YaoSheng Yang Soochow University
  • Meishan Zhang Heilongjiang University
  • Wenliang Chen Soochow University
  • Wei Zhang Alibaba Group
  • Haofen Wang Shenzhen Gowild Robotics Co. Ltd
  • Min Zhang Soochow University

Keywords:

Named Entity Recognition, Crowd Annotations

Abstract

To quickly obtain new labeled data, we can choose crowdsourcing as an alternative way at lower cost in a short time. But as an exchange, crowd annotations from non-experts may be of lower quality than those from experts. In this paper, we propose an approach to performing crowd annotation learning for Chinese Named Entity Recognition (NER) to make full use of the noisy sequence labels from multiple annotators. Inspired by adversarial learning, our approach uses a common Bi-LSTM and a private Bi-LSTM for representing annotator-generic and -specific information. The annotator-generic information is the common knowledge for entities easily mastered by the crowd. Finally, we build our Chinese NE tagger based on the LSTM-CRF model. In our experiments, we create two data sets for Chinese NER tasks from two domains. The experimental results show that our system achieves better scores than strong baseline systems.

Downloads

Published

2018-04-25

How to Cite

Yang, Y., Zhang, M., Chen, W., Zhang, W., Wang, H., & Zhang, M. (2018). Adversarial Learning for Chinese NER From Crowd Annotations. Proceedings of the AAAI Conference on Artificial Intelligence, 32(1). Retrieved from https://ojs.aaai.org/index.php/AAAI/article/view/11507

Issue

Section

AAAI Technical Track: Human-Computation and Crowd Sourcing