Order-Preserving Dimension Reduction for Multimodal Semantic Embedding

Authors

  • Chengyu Gong New York University
  • Gefei Shen Harvard University
  • Luanzheng Guo Pacific Northwest National Laboratory
  • Nathan R. Tallent Pacific Northwest National Laboratory
  • Dongfang Zhao University of Washington

DOI:

https://doi.org/10.1609/aaai.v40i17.38496

Abstract

Searching for the k-nearest neighbors in multimodal data retrieval is computationally expensive, particularly due to the inherent difficulty in comparing similarity measures across different modalities. Recent advances in multimodal machine learning address this issue by mapping data into a shared embedding space; however, the high dimensionality of these embeddings (hundreds to thousands of dimensions) presents a challenge for time-sensitive vision applications. This work proposes Order-Preserving Dimension Reduction (OPDR), aiming to reduce the dimensionality of embeddings while preserving the ranking of KNN in the lower-dimensional space. One notable component of OPDR is a new measure function to quantify KNN quality as a global metric, based on which we derive a closed-form map between target dimensionality and key contextual parameters. We have integrated OPDR with multiple state-of-the-art dimension-reduction techniques, distance functions, and embedding models; experiments on a variety of multimodal datasets demonstrate that OPDR effectively retains recall high accuracy while significantly reducing computational costs.

Downloads

Published

2026-03-14

How to Cite

Gong, C., Shen, G., Guo, L., Tallent, N. R., & Zhao, D. (2026). Order-Preserving Dimension Reduction for Multimodal Semantic Embedding. Proceedings of the AAAI Conference on Artificial Intelligence, 40(17), 14765–14773. https://doi.org/10.1609/aaai.v40i17.38496

Issue

Section

AAAI Technical Track on Data Mining & Knowledge Management I