Joint Word Representation Learning Using a Corpus and a Semantic Lexicon

Danushka Bollegala; Mohammed Alsuhaibani; Takanori Maehara; Ken-ichi Kawarabayashi

doi:10.1609/aaai.v30i1.10340

Authors

Danushka Bollegala The University of Liverpool
Mohammed Alsuhaibani The University of Liverpool
Takanori Maehara Shizuoka University
Ken-ichi Kawarabayashi National Institute of Informatics

DOI:

https://doi.org/10.1609/aaai.v30i1.10340

Keywords:

Deep Learning, Word Representations

Abstract

Methods for learning word representations using large text corpora have received much attention lately due to their impressive performancein numerous natural language processing (NLP) tasks such as, semantic similarity measurement, and word analogy detection.Despite their success, these data-driven word representation learning methods do not considerthe rich semantic relational structure between words in a co-occurring context. On the other hand, already much manual effort has gone into the construction of semantic lexicons such as the WordNetthat represent the meanings of words by defining the various relationships that exist among the words in a language.We consider the question, can we improve the word representations learnt using a corpora by integrating theknowledge from semantic lexicons?. For this purpose, we propose a joint word representation learning method that simultaneously predictsthe co-occurrences of two words in a sentence subject to the relational constrains given by the semantic lexicon.We use relations that exist between words in the lexicon to regularize the word representations learnt from the corpus.Our proposed method statistically significantly outperforms previously proposed methods for incorporating semantic lexicons into wordrepresentations on several benchmark datasets for semantic similarity and word analogy.

Joint Word Representation Learning Using a Corpus and a Semantic Lexicon

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Subscription