Attention Beam: An Image Captioning Approach (Student Abstract)

Anubhav Shrimal; Tanmoy Chakraborty

doi:10.1609/aaai.v35i18.17940

Attention Beam: An Image Captioning Approach (Student Abstract)

Authors

Anubhav Shrimal IIIT-Delhi, India
Tanmoy Chakraborty IIIT-Delhi, India

DOI:

https://doi.org/10.1609/aaai.v35i18.17940

Keywords:

Image Captioning, Beam Search, Attention Network, Natural Language Generation, Computer Vision

Abstract

The aim of image captioning is to generate textual description of a given image. Though seemingly an easy task for humans, it is challenging for machines as it requires the ability to comprehend the image (computer vision) and consequently generate a human-like description for the image (natural language understanding). In recent times, encoder-decoder based architectures have achieved state-of-the-art results for image captioning. Here, we present a heuristic of beam search on top of the encoder-decoder based architecture that gives better quality captions on three benchmark datasets: Flickr8k, Flickr30k and MS COCO.

Downloads

Published

2021-05-18

How to Cite

Shrimal, A., & Chakraborty, T. (2021). Attention Beam: An Image Captioning Approach (Student Abstract). Proceedings of the AAAI Conference on Artificial Intelligence, 35(18), 15887–15888. https://doi.org/10.1609/aaai.v35i18.17940

Download Citation

Issue

Vol. 35 No. 18: AAAI-21 Student Papers and Demonstrations

Section

AAAI Student Abstract and Poster Program

Attention Beam: An Image Captioning Approach (Student Abstract)

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information