Concepts Not Alone: Exploring Pairwise Relationships for Zero-Shot Video Activity Recognition

Authors

  • Chuang Gan Tsinghua University
  • Ming Lin University of Michigan, Ann Arbor
  • Yi Yang University of Technology Sydney
  • Gerard Melo Tsinghua University
  • Alexander G. Hauptmann Carnegie Mellon University

DOI:

https://doi.org/10.1609/aaai.v30i1.10466

Keywords:

zero-shot, video

Abstract

Vast quantities of videos are now being captured at astonishing rates, but the majority of these are not labelled. To cope with such data, we consider the task of content-based activity recognition in videos without any manually labelled examples, also known as zero-shot video recognition. To achieve this, videos are represented in terms of detected visual concepts, which are then scored as relevant or irrelevant according to their similarity with a given textual query. In this paper, we propose a more robust approach for scoring concepts in order to alleviate many of the brittleness and low precision problems of previous work. Not only do we jointly consider semantic relatedness, visual reliability, and discriminative power. To handle noise and non-linearities in the ranking scores of the selected concepts, we propose a novel pairwise order matrix approach for score aggregation. Extensive experiments on the large-scale TRECVID Multimedia Event Detection data show the superiority of our approach.

Downloads

Published

2016-03-05

How to Cite

Gan, C., Lin, M., Yang, Y., Melo, G., & G. Hauptmann, A. (2016). Concepts Not Alone: Exploring Pairwise Relationships for Zero-Shot Video Activity Recognition. Proceedings of the AAAI Conference on Artificial Intelligence, 30(1). https://doi.org/10.1609/aaai.v30i1.10466