News Content Completion with Location-Aware Image Selection

Authors

  • Zhengkun Zhang Nankai University
  • Jun Wang Ludong University
  • Adam Jatowt Kyoto University
  • Zhe Sun RIKEN
  • Shao-Ping Lu Nankai University
  • Zhenglu Yang Nankai University

DOI:

https://doi.org/10.1609/aaai.v35i16.17704

Keywords:

Language Grounding & Multi-modal NLP

Abstract

News, as one of the fundamental social media types, typically contains both texts and images. Image selection, which involves choosing appropriate images according to some specified contexts, is crucial for formulating good news. However, it presents two challenges: where to place images and which images to use. The difficulties associated with this where-which problem lie in the fact that news typically contains linguistically rich text that delivers complex information and more than one image. In this paper, we propose a novel end-to-end two-stage framework to address these issues comprehensively. In the first stage, we identify key information in news by using location embeddings, which represent the local contextual information of each candidate location for image insertion. Then, in the second stage, we thoroughly examine the candidate images and select the most context-related ones to insert into each location identified in the first stage. We also introduce three insertion strategies to formulate different scenarios influencing the image selection procedure. Extensive experiments demonstrate the consistent superiority of the proposed framework in image selection.

Downloads

Published

2021-05-18

How to Cite

Zhang, Z., Wang, J., Jatowt, A., Sun, Z., Lu, S.-P., & Yang, Z. (2021). News Content Completion with Location-Aware Image Selection. Proceedings of the AAAI Conference on Artificial Intelligence, 35(16), 14498-14505. https://doi.org/10.1609/aaai.v35i16.17704

Issue

Section

AAAI Technical Track on Speech and Natural Language Processing III