EmojiNet: An Open Service and API for Emoji Sense Discovery

Authors

  • Sanjaya Wijeratne Wright State University
  • Lakshika Balasuriya Wright State University
  • Amit Sheth Wright State University
  • Derek Doran Wright State University

Abstract

This paper presents the release of EmojiNet, the largest machine-readable emoji sense inventory that links Unicode emoji representations to their English meanings extracted from the Web. EmojiNet is a dataset consisting of: (i) 12,904 sense labels over 2,389 emoji, which were extracted from the web and linked to machine-readable sense definitions seen in BabelNet; (ii) context words associated with each emoji sense, which are inferred through word embedding models trained over Google News corpus and a Twitter message corpus for each emoji sense definition; and (iii) recognizing discrepancies in the presentation of emoji on different platforms, specification of the most likely platform-based emoji sense for a selected set of emoji. The dataset is hosted as an open service with a REST API and is available at http://emojinet.knoesis.org/. The development of this dataset, evaluation of its quality, and its applications including emoji sense disambiguation and emoji sense similarity are discussed.

Downloads

Published

2017-05-03

How to Cite

Wijeratne, S., Balasuriya, L., Sheth, A., & Doran, D. (2017). EmojiNet: An Open Service and API for Emoji Sense Discovery. Proceedings of the International AAAI Conference on Web and Social Media, 11(1), 437-446. Retrieved from https://ojs.aaai.org/index.php/ICWSM/article/view/14857