Consonant-Vowel Sequences as Subword Units for Code-Mixed Languages

Authors

  • Upendra Kumar Indian Institute of Information Technology, Sri City, AP
  • Vishal Singh Indian Institute of Information Technology, Sri City, AP
  • Chris Andrew Indian Institute of Information Technology, Sri City, AP
  • Santhoshini Reddy Indian Institute of Information Technology, Sri City, AP
  • Amitava Das Indian Institute of Information Technology, Sri City, AP

Keywords:

Code Mixing, Deep Learning

Abstract

In this research work, we develop a state-of-art model for identifying sentiment in Hindi-English code-mixed language. We introduce new phonemic sub-word units for Hindi-English code-mixed text along with a hierarchical deep learning model which uses these sub-word units for predicting sentiment. The results indicate that the model yields a significant increase in accuracy as compared to other models.

Downloads

Published

2018-04-29

How to Cite

Kumar, U., Singh, V., Andrew, C., Reddy, S., & Das, A. (2018). Consonant-Vowel Sequences as Subword Units for Code-Mixed Languages. Proceedings of the AAAI Conference on Artificial Intelligence, 32(1). Retrieved from https://ojs.aaai.org/index.php/AAAI/article/view/12193