A Deep Learning Approach for Arabic Caption Generation Using Roots-Words

Authors

  • Vasu Jindal University of Texas

DOI:

https://doi.org/10.1609/aaai.v31i1.11090

Keywords:

computer vision, machine learning, deep learning, image caption

Abstract

Automatic caption generation is a key research field in the machine learning community. However, most of the current research is performed on English caption generation ignoring other languages like Arabic and Persian. In this paper, we propose a novel technique leveraging the heavy influence of root words in Arabic to automatically generate captions in Arabic. Fragments of the images are associated with root words and deep belief network pre-trained using Restricted Boltzmann Machines are used to extract words associated with image. Finally, dependency tree relations are used to generate sentence-captions by using the dependency on root words. Our approach is robust and attains BLEU-1 score of 34.8.

Downloads

Published

2017-02-12

How to Cite

Jindal, V. (2017). A Deep Learning Approach for Arabic Caption Generation Using Roots-Words. Proceedings of the AAAI Conference on Artificial Intelligence, 31(1). https://doi.org/10.1609/aaai.v31i1.11090