Asymmetric Mutual Alignment for Unsupervised Zero-Shot Sketch-Based Image Retrieval

Authors

  • Zhihui Yin Xidian University
  • Jiexi Yan Xidian University
  • Chenghao Xu Xidian University
  • Cheng Deng Xidian University

DOI:

https://doi.org/10.1609/aaai.v38i15.29588

Keywords:

ML: Transfer, Domain Adaptation, Multi-Task Learning, CV: Multi-modal Vision

Abstract

In recent years, many methods have been proposed to address the zero-shot sketch-based image retrieval (ZS-SBIR) task, which is a practical problem in many applications. However, in real-world scenarios, on the one hand, we can not obtain training data with the same distribution as the test data, and on the other hand, the labels of training data are not available as usual. To tackle this issue, we focus on a new problem, namely unsupervised zero-shot sketch-based image retrieval (UZS-SBIR), where the available training data does not have labels while the training and testing categories are not overlapping. In this paper, we introduce a new asymmetric mutual alignment method (AMA) including a self-distillation module and a cross-modality mutual alignment module. First, we conduct self-distillation to extract the feature embeddings from unlabeled data. Due to the lack of available information in an unsupervised manner, we employ the cross-modality mutual alignment module to further excavate underlying intra-modality and inter-modality relationships from unlabeled data, and take full advantage of these correlations to align the feature embeddings in image and sketch domains. Meanwhile, the feature representations are enhanced by the intra-modality clustering relations, leading to better generalization ability to unseen classes. Moreover, we conduct an asymmetric strategy to update the teacher and student networks, respectively. Extensive experimental results on several benchmark datasets demonstrate the superiority of our method.

Published

2024-03-24

How to Cite

Yin, Z., Yan, J., Xu, C., & Deng, C. (2024). Asymmetric Mutual Alignment for Unsupervised Zero-Shot Sketch-Based Image Retrieval. Proceedings of the AAAI Conference on Artificial Intelligence, 38(15), 16504-16512. https://doi.org/10.1609/aaai.v38i15.29588

Issue

Section

AAAI Technical Track on Machine Learning VI