(1)
Wang, N.; Deng, J.; Jia, M. Cycle-Consistency Learning for Captioning and Grounding. AAAI 2024, 38, 5535-5543.