Generating Image Captions in Arabic Using Root-Word Based Recurrent Neural Networks and Deep Neural Networks

Vasu Jindal

doi:10.1609/aaai.v32i1.12179

Generating Image Captions in Arabic Using Root-Word Based Recurrent Neural Networks and Deep Neural Networks

Authors

Vasu Jindal University of Texas at Dallas

DOI:

https://doi.org/10.1609/aaai.v32i1.12179

Keywords:

deep learning, computer vision, machine learning

Abstract

Automatic caption generation of an image requires both computer vision and natural language processing techniques. Despite of advanced research in English caption generation, research on generating Arabic descriptions of an image is extremely limited. Semitic languages like Arabic are heavily influenced by root-words. We leverage this critical dependency of Arabic and in this paper are the first to generate captions of an image directly in Arabic using root-word based Recurrent Neural Networks and Deep Neural Networks. We report the first BLEU score for direct Arabic caption generation. Experimental results confirm that generating image captions using root-words directly in Arabic significantly outperforms the English-Arabic translated captions using state-of-the-art methods.

Downloads

Published

2018-04-29

How to Cite

Jindal, V. (2018). Generating Image Captions in Arabic Using Root-Word Based Recurrent Neural Networks and Deep Neural Networks. Proceedings of the AAAI Conference on Artificial Intelligence, 32(1). https://doi.org/10.1609/aaai.v32i1.12179

Download Citation

Issue

Vol. 32 No. 1 (2018): Thirty-Second AAAI Conference on Artificial Intelligence

Section

Student Abstract Track

Generating Image Captions in Arabic Using Root-Word Based Recurrent Neural Networks and Deep Neural Networks

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information