Consonant-Vowel Sequences as Subword Units for Code-Mixed Languages

Authors

  • Upendra Kumar Indian Institute of Information Technology, Sri City, AP
  • Vishal Singh Indian Institute of Information Technology, Sri City, AP
  • Chris Andrew Indian Institute of Information Technology, Sri City, AP
  • Santhoshini Reddy Indian Institute of Information Technology, Sri City, AP
  • Amitava Das Indian Institute of Information Technology, Sri City, AP

DOI:

https://doi.org/10.1609/aaai.v32i1.12193

Keywords:

Code Mixing, Deep Learning

Abstract

In this research work, we develop a state-of-art model for identifying sentiment in Hindi-English code-mixed language. We introduce new phonemic sub-word units for Hindi-English code-mixed text along with a hierarchical deep learning model which uses these sub-word units for predicting sentiment. The results indicate that the model yields a significant increase in accuracy as compared to other models.

Downloads

Published

2018-04-29

How to Cite

Kumar, U., Singh, V., Andrew, C., Reddy, S., & Das, A. (2018). Consonant-Vowel Sequences as Subword Units for Code-Mixed Languages. Proceedings of the AAAI Conference on Artificial Intelligence, 32(1). https://doi.org/10.1609/aaai.v32i1.12193