An Optimal Transport Approach to Deep Metric Learning (Student Abstract)

Authors

  • Jason Xiaotian Dou Department of Electrical and Computer Engineering, University of Pittsburgh
  • Lei Luo Department of Electrical and Computer Engineering, University of Pittsburgh
  • Raymond Mingrui Yang Department of Electrical and Computer Engineering, Carnegie Mellon University

DOI:

https://doi.org/10.1609/aaai.v36i11.21604

Keywords:

Optimal Transport, Metric Learning, Image Retrival, Clustering

Abstract

Capturing visual similarity among images is the core of many computer vision and pattern recognition tasks. This problem can be formulated in such a paradigm called metric learning. Most research in the area has been mainly focusing on improving the loss functions and similarity measures. However, due to the ignoring of geometric structure, existing methods often lead to sub-optimal results. Thus, several recent research methods took advantage of Wasserstein distance between batches of samples to characterize the spacial geometry. Although these approaches can achieve enhanced performance, the aggregation over batches definitely hinders Wasserstein distance's superior measure capability and leads to high computational complexity. To address this limitation, we propose a novel Deep Wasserstein Metric Learning framework, which employs Wasserstein distance to precisely capture the relationship among various images under ranking-based loss functions such as contrastive loss and triplet loss. Our method directly computes the distance between images, considering the geometry at a finer granularity than batch level. Furthermore, we introduce a new efficient algorithm using Sinkhorn approximation and Wasserstein measure coreset. The experimental results demonstrate the improvements of our framework over various baselines in different applications and benchmark datasets.

Downloads

Published

2022-06-28

How to Cite

Dou, J. X., Luo, L., & Yang, R. M. (2022). An Optimal Transport Approach to Deep Metric Learning (Student Abstract). Proceedings of the AAAI Conference on Artificial Intelligence, 36(11), 12935-12936. https://doi.org/10.1609/aaai.v36i11.21604