Neighborhood-Adaptive Structure Augmented Metric Learning

Authors

  • Pandeng Li University of Science and Technology of China
  • Yan Li Kuaishou
  • Hongtao Xie University of Science and Technology of China
  • Lei Zhang Kuaishou

DOI:

https://doi.org/10.1609/aaai.v36i2.20025

Keywords:

Computer Vision (CV)

Abstract

Most metric learning techniques typically focus on sample embedding learning, while implicitly assume a homogeneous local neighborhood around each sample, based on the metrics used in training ( e.g., hypersphere for Euclidean distance or unit hyperspherical crown for cosine distance). As real-world data often lies on a low-dimensional manifold curved in a high-dimensional space, it is unlikely that everywhere of the manifold shares the same local structures in the input space. Besides, considering the non-linearity of neural networks, the local structure in the output embedding space may not be homogeneous as assumed. Therefore, representing each sample simply with its embedding while ignoring its individual neighborhood structure would have limitations in Embedding-Based Retrieval (EBR). By exploiting the heterogeneity of local structures in the embedding space, we propose a Neighborhood-Adaptive Structure Augmented metric learning framework (NASA), where the neighborhood structure is realized as a structure embedding, and learned along with the sample embedding in a self-supervised manner. In this way, without any modifications, most indexing techniques can be used to support large-scale EBR with NASA embeddings. Experiments on six standard benchmarks with two kinds of embeddings, i.e., binary embeddings and real-valued embeddings, show that our method significantly improves and outperforms the state-of-the-art methods.

Downloads

Published

2022-06-28

How to Cite

Li, P., Li, Y., Xie, H., & Zhang, L. (2022). Neighborhood-Adaptive Structure Augmented Metric Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 36(2), 1367-1375. https://doi.org/10.1609/aaai.v36i2.20025

Issue

Section

AAAI Technical Track on Computer Vision II