Xiang, T., Li, Y., Liu, L., Zhi, H., Chen, C., Du, Q., & Tan, M. (2026). FAM: Fine-Grained Alignment Matters in Multimodal Embedding Learning with Large Vision-Language Models. Proceedings of the AAAI Conference on Artificial Intelligence, 40(32), 27046–27054. https://doi.org/10.1609/aaai.v40i32.39918