Distinguish Polarity in Bag-of-Words Visualization

Authors

  • Yusheng Xie Baidu Research
  • Zhengzhang Chen Northwestern University
  • Ankit Agrawal Northwestern University
  • Alok Choudhary Northwestern University

DOI:

https://doi.org/10.1609/aaai.v31i1.10963

Keywords:

t-SNE, sentiment, word embedding, auto encoder

Abstract

Neural network-based BOW models reveal that word-embedding vectors encode strong semantic regularities. However, such models are insensitive to word polarity. We show that, coupled with simple information such as word spellings, word-embedding vectors can preserve both semantic regularity and conceptual polarity without supervision. We then describe a nontrivial modification to the t-distributed stochastic neighbor embedding (t-SNE) algorithm that visualizes these semantic- and polarity-preserving vectors in reduced dimensions. On a real Facebook corpus, our experiments show significant improvement in t-SNE visualization as a result of the proposed modification.

Downloads

Published

2017-02-12

How to Cite

Xie, Y., Chen, Z., Agrawal, A., & Choudhary, A. (2017). Distinguish Polarity in Bag-of-Words Visualization. Proceedings of the AAAI Conference on Artificial Intelligence, 31(1). https://doi.org/10.1609/aaai.v31i1.10963