Learning Context-Specific Word/Character Embeddings

Authors

  • Xiaoqing Zheng Fudan University
  • Jiangtao Feng Fudan University
  • Yi Chen Fudan University
  • Haoyuan Peng Fudan University
  • Wenqing Zhang Fudan University

DOI:

https://doi.org/10.1609/aaai.v31i1.10985

Keywords:

Word embeddings, Neural network, Unsupervised Learning

Abstract

Unsupervised word representations have demonstrated improvements in predictive generalization on various NLP tasks. Most of the existing models are in fact good at capturing the relatedness among words rather than their ''genuine'' similarity because the context representations are often represented by a sum (or an average) of the neighbor's embeddings, which simplifies the computation but ignores an important fact that the meaning of a word is determined by its context, reflecting not only the surrounding words but also the rules used to combine them (i.e. compositionality). On the other hand, much effort has been devoted to learning a single-prototype representation per word, which is problematic because many words are polysemous, and a single-prototype model is incapable of capturing phenomena of homonymy and polysemy. We present a neural network architecture to jointly learn word embeddings and context representations from large data sets. The explicitly produced context representations are further used to learn context-specific and multi-prototype word embeddings. Our embeddings were evaluated on several NLP tasks, and the experimental results demonstrated the proposed model outperformed other competitors and is applicable to intrinsically "character-based" languages.

Downloads

Published

2017-02-12

How to Cite

Zheng, X., Feng, J., Chen, Y., Peng, H., & Zhang, W. (2017). Learning Context-Specific Word/Character Embeddings. Proceedings of the AAAI Conference on Artificial Intelligence, 31(1). https://doi.org/10.1609/aaai.v31i1.10985