Spectral Word Embedding with Negative Sampling

Behrouz Haji Soleimani; Stan Matwin

doi:10.1609/aaai.v32i1.12015

Authors

Behrouz Haji Soleimani Dalhousie University
Stan Matwin Dalhousie University

DOI:

https://doi.org/10.1609/aaai.v32i1.12015

Keywords:

Word Embedding, Natural Language Processing, Unsupervised Learning, Matrix Factorization, Spectral Algorithms, Singular Value Decomposition

Abstract

In this work, we investigate word embedding algorithms in the context of natural language processing. In particular, we examine the notion of ``negative examples'', the unobserved or insignificant word-context co-occurrences, in spectral methods. we provide a new formulation for the word embedding problem by proposing a new intuitive objective function that perfectly justifies the use of negative examples. In fact, our algorithm not only learns from the important word-context co-occurrences, but also it learns from the abundance of unobserved or insignificant co-occurrences to improve the distribution of words in the latent embedded space. We analyze the algorithm theoretically and provide an optimal solution for the problem using spectral analysis. We have trained various word embedding algorithms on articles of Wikipedia with 2.1 billion tokens and show that negative sampling can boost the quality of spectral methods. Our algorithm provides results as good as the state-of-the-art but in a much faster and efficient way.

Spectral Word Embedding with Negative Sampling

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information