Face Video Retrieval via Deep Learning of Binary Hash Representations

Authors

  • Zhen Dong Beijing Institute of Technology
  • Su Jia Stony Brook University
  • Tianfu Wu Beijing University of Posts and Telecommunications and University of California, Los Angeles
  • Mingtao Pei Beijing Institute of Technology

DOI:

https://doi.org/10.1609/aaai.v30i1.10445

Keywords:

face video retrieval, convolutional neural network

Abstract

Retrieving faces from large mess of videos is an attractive research topic with wide range of applications. Its challenging problems are large intra-class variations, and tremendous time and space complexity. In this paper, we develop a new deep convolutional neural network (deep CNN) to learn discriminative and compact binary representations of faces for face video retrieval. The network integrates feature extraction and hash learning into a unified optimization framework for the optimal compatibility of feature extractor and hash functions. In order to better initialize the network, the low-rank discriminative binary hashing is proposed to pre-learn hash functions during the training procedure. Our method achieves excellent performances on two challenging TV-Series datasets.

Downloads

Published

2016-03-05

How to Cite

Dong, Z., Jia, S., Wu, T., & Pei, M. (2016). Face Video Retrieval via Deep Learning of Binary Hash Representations. Proceedings of the AAAI Conference on Artificial Intelligence, 30(1). https://doi.org/10.1609/aaai.v30i1.10445