[1]
Z. Zhao, “Heterogeneous Prompt-Guided Entity Inferring and Distilling for Scene-Text Aware Cross-Modal Retrieval”, AAAI, vol. 39, no. 10, pp. 10537–10545, Apr. 2025.