Order-Preserving Dimension Reduction for Multimodal Semantic Embedding
DOI:
https://doi.org/10.1609/aaai.v40i17.38496Abstract
Searching for the k-nearest neighbors in multimodal data retrieval is computationally expensive, particularly due to the inherent difficulty in comparing similarity measures across different modalities. Recent advances in multimodal machine learning address this issue by mapping data into a shared embedding space; however, the high dimensionality of these embeddings (hundreds to thousands of dimensions) presents a challenge for time-sensitive vision applications. This work proposes Order-Preserving Dimension Reduction (OPDR), aiming to reduce the dimensionality of embeddings while preserving the ranking of KNN in the lower-dimensional space. One notable component of OPDR is a new measure function to quantify KNN quality as a global metric, based on which we derive a closed-form map between target dimensionality and key contextual parameters. We have integrated OPDR with multiple state-of-the-art dimension-reduction techniques, distance functions, and embedding models; experiments on a variety of multimodal datasets demonstrate that OPDR effectively retains recall high accuracy while significantly reducing computational costs.Published
2026-03-14
How to Cite
Gong, C., Shen, G., Guo, L., Tallent, N. R., & Zhao, D. (2026). Order-Preserving Dimension Reduction for Multimodal Semantic Embedding. Proceedings of the AAAI Conference on Artificial Intelligence, 40(17), 14765–14773. https://doi.org/10.1609/aaai.v40i17.38496
Issue
Section
AAAI Technical Track on Data Mining & Knowledge Management I