Neural Networks Incorporating Dictionaries for Chinese Word Segmentation

Authors

  • Qi Zhang Fudan University
  • Xiaoyu Liu Fudan University
  • Jinlan Fu Fudan University

DOI:

https://doi.org/10.1609/aaai.v32i1.11959

Keywords:

Chinese word segmentation, Deep Learning

Abstract

In recent years, deep neural networks have achieved significant success in Chinese word segmentation and many other natural language processing tasks. Most of these algorithms are end-to-end trainable systems and can effectively process and learn from large scale labeled datasets. However, these methods typically lack the capability of processing rare words and data whose domains are different from training data. Previous statistical methods have demonstrated that human knowledge can provide valuable information for handling rare cases and domain shifting problems. In this paper, we seek to address the problem of incorporating dictionaries into neural networks for the Chinese word segmentation task. Two different methods that extend the bi-directional long short-term memory neural network are proposed to perform the task. To evaluate the performance of the proposed methods, state-of-the-art supervised models based methods and domain adaptation approaches are compared with our methods on nine datasets from different domains. The experimental results demonstrate that the proposed methods can achieve better performance than other state-of-the-art neural network methods and domain adaptation approaches in most cases.

Downloads

Published

2018-04-27

How to Cite

Zhang, Q., Liu, X., & Fu, J. (2018). Neural Networks Incorporating Dictionaries for Chinese Word Segmentation. Proceedings of the AAAI Conference on Artificial Intelligence, 32(1). https://doi.org/10.1609/aaai.v32i1.11959