(1)
Jiang, C.; Ye, W.; Xu, H.; Ye, Q.; Yan, M.; Zhang, J.; Zhang, S. TiMix: Text-Aware Image Mixing for Effective Vision-Language Pre-Training. AAAI 2024, 38, 2489-2497.