(1)
Lu, Y.; Zhang, Z.; Yuan, C.; Li, P.; Wang, Y.; Li, B.; Hu, W. Set Prediction Guided by Semantic Concepts for Diverse Video Captioning. AAAI 2024, 38, 3909-3917.