Web-Based Semantic Fragment Discovery for On-Line Lingual-Visual Similarity

Authors

  • Xiaoshuai Sun The University of Queensland and Harbin Institute of Technology
  • Jiewei Cao The University of Queensland
  • Chao Li The University of Queensland
  • Lei Zhu The University of Queensland
  • Heng Tao Shen The University of Queensland and University of Electronic Science and Technology of China

DOI:

https://doi.org/10.1609/aaai.v31i1.10490

Keywords:

Web Knowledge Mining, Lingual-Visual Analysis, Automatic Multimedia Insertion and Ordering

Abstract

In this paper, we present an automatic approach for on-line discovery of visual-lingual semantic fragments from weakly labeled Internet images. Instead of learning region-entity correspondences from well-labeled image-sentence pairs, our approach directly collects and enhances the weakly labeled visual contents from the Web and constructs an adaptive visual representation which automatically links generic lingual phrases to their related visual contents. To ensure reliable and efficient semantic discovery, we adopt non-parametric density estimation to re-rank the related visual instances and proposed a fast self-similarity-based quality assessment method to identify the high-quality semantic fragments. The discovered semantic fragments provide an adaptive joint representation for texts and images, based on which lingual-visual similarity can be defined for further co-analysis of heterogeneous multimedia data. Experimental results on semantic fragment quality assessment, sentence-based image retrieval, automatic multimedia insertion and ordering demonstrated the effectiveness of the proposed framework.The experiments show that the proposed methods can make effective use of the Web knowledge, and are able to generate competitive results compared to state-of-the-art approaches in various tasks.

Downloads

Published

2017-02-10

How to Cite

Sun, X., Cao, J., Li, C., Zhu, L., & Shen, H. T. (2017). Web-Based Semantic Fragment Discovery for On-Line Lingual-Visual Similarity. Proceedings of the AAAI Conference on Artificial Intelligence, 31(1). https://doi.org/10.1609/aaai.v31i1.10490