Transitive Hashing Network for Heterogeneous Multimedia Retrieval

Authors

  • Zhangjie Cao Tsinghua University
  • Mingsheng Long Tsinghua University
  • Jianmin Wang Tsinghua University
  • Qiang Yang Hong Kong University of Science and Technology

DOI:

https://doi.org/10.1609/aaai.v31i1.10487

Keywords:

Deep Hashing, Transitive Hashing

Abstract

Hashing is widely applied to large-scale multimedia retrieval due to the storage and retrieval efficiency. Cross-modal hashing enables efficient retrieval of one modality from database relevant to a query of another modality. Existing work on cross-modal hashing assumes that heterogeneous relationship across modalities is available for learning to hash. This paper relaxes this strict assumption by only requiring heterogeneous relationship in some auxiliary dataset different from the query or database domain. We design a novel hybrid deep architecture, transitive hashing network (THN), to jointly learn cross-modal correlation from the auxiliary dataset, and align the data distributions of the auxiliary dataset with that of the query or database domain, which generates compact transitive hash codes for efficient cross-modal retrieval. Comprehensive empirical evidence validates that the proposed THN approach yields state of the art retrieval performance on standard multimedia benchmarks, i.e. NUS-WIDE and ImageNet-YahooQA.

Downloads

Published

2017-02-10

How to Cite

Cao, Z., Long, M., Wang, J., & Yang, Q. (2017). Transitive Hashing Network for Heterogeneous Multimedia Retrieval. Proceedings of the AAAI Conference on Artificial Intelligence, 31(1). https://doi.org/10.1609/aaai.v31i1.10487