Zera-Shot Sentiment Analysis for Code-Mixed Data

Authors

  • Siddharth Yadav IIIT-Delhi, India
  • Tanmoy Chakraborty IIIT-Delhi, India

Keywords:

Sentiment Analysis, Code-mixed Text, Zero-shot Learning, Unsupervised Learning

Abstract

Code-mixing is the practice of alternating between two or more languages. A major part of sentiment analysis research has been monolingual and they perform poorly on the code-mixed text. We introduce methods that use multilingual and cross-lingual embeddings to transfer knowledge from monolingual text to code-mixed text for code-mixed sentiment analysis. Our methods handle code-mixed text through zero-shot learning and beat state-of-the-art English-Spanish code-mixed sentiment analysis by an absolute 3% F1-score. We are able to achieve 0.58 F1-score (without a parallel corpus) and 0.62 F1-score (with the parallel corpus) on the same benchmark in a zero-shot way as compared to 0.68 F1-score in supervised settings. Our code is publicly available on github.com/sedflix/unsacmt.

Downloads

Published

2021-05-18

How to Cite

Yadav, S., & Chakraborty, T. (2021). Zera-Shot Sentiment Analysis for Code-Mixed Data. Proceedings of the AAAI Conference on Artificial Intelligence, 35(18), 15941-15942. Retrieved from https://ojs.aaai.org/index.php/AAAI/article/view/17967

Issue

Section

AAAI Student Abstract and Poster Program