Xiang, Tianhang, Yirui Li, Lizhao Liu, Hongyan Zhi, Chuanshen Chen, Qing Du, and Mingkui Tan. 2026. “FAM: Fine-Grained Alignment Matters in Multimodal Embedding Learning With Large Vision-Language Models”. Proceedings of the AAAI Conference on Artificial Intelligence 40 (32):27046-54. https://doi.org/10.1609/aaai.v40i32.39918.