Zera-Shot Sentiment Analysis for Code-Mixed Data

Siddharth Yadav; Tanmoy Chakraborty

doi:10.1609/aaai.v35i18.17967

Authors

Siddharth Yadav IIIT-Delhi, India
Tanmoy Chakraborty IIIT-Delhi, India

DOI:

https://doi.org/10.1609/aaai.v35i18.17967

Keywords:

Sentiment Analysis, Code-mixed Text, Zero-shot Learning, Unsupervised Learning

Abstract

Code-mixing is the practice of alternating between two or more languages. A major part of sentiment analysis research has been monolingual and they perform poorly on the code-mixed text. We introduce methods that use multilingual and cross-lingual embeddings to transfer knowledge from monolingual text to code-mixed text for code-mixed sentiment analysis. Our methods handle code-mixed text through zero-shot learning and beat state-of-the-art English-Spanish code-mixed sentiment analysis by an absolute 3% F1-score. We are able to achieve 0.58 F1-score (without a parallel corpus) and 0.62 F1-score (with the parallel corpus) on the same benchmark in a zero-shot way as compared to 0.68 F1-score in supervised settings. Our code is publicly available on github.com/sedflix/unsacmt.

Zera-Shot Sentiment Analysis for Code-Mixed Data

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription