Improving Word Embeddings with Convolutional Feature Learning and Subword Information

Shaosheng Cao; Wei Lu

doi:10.1609/aaai.v31i1.10993

Improving Word Embeddings with Convolutional Feature Learning and Subword Information

Authors

Shaosheng Cao Singapore University of Technology and Design
Wei Lu Singapore University of Technology and Design

DOI:

https://doi.org/10.1609/aaai.v31i1.10993

Abstract

We present a novel approach to learning word embeddings by exploring subword information (character n-gram, root/affix and inflections) and capturing the structural information of their context with convolutional feature learning. Specifically, we introduce a convolutional neural network architecture that allows us to measure structural information of context words and incorporate subword features conveying semantic, syntactic and morphological information related to the words. To assess the effectiveness of our model, we conduct extensive experiments on the standard word similarity and word analogy tasks. We showed improvements over existing state-of-the-art methods for learning word embeddings, including skipgram, GloVe, char n-gram and DSSM.

Downloads

Published

2017-02-12

How to Cite

Cao, S., & Lu, W. (2017). Improving Word Embeddings with Convolutional Feature Learning and Subword Information. Proceedings of the AAAI Conference on Artificial Intelligence, 31(1). https://doi.org/10.1609/aaai.v31i1.10993

Download Citation

Issue

Vol. 31 No. 1 (2017): Thirty-First AAAI Conference on Artificial Intelligence

Section

Main Track: NLP and Machine Learning