[1]
Y. Wang, “Enhancing Fine-Grained Vision-Language Pretraining with Negative Augmented Samples”, AAAI, vol. 39, no. 8, pp. 8060–8068, Apr. 2025.