Zhang, P., Lai, Z., Chen, W., Wu, X., & Kong, H. (2026). FaNe: Towards Fine-Grained Cross-Modal Contrast with False-Negative Reduction and Text-Conditioned Sparse Attention. Proceedings of the AAAI Conference on Artificial Intelligence, 40(15), 12681–12689. https://doi.org/10.1609/aaai.v40i15.38264