Lu, Y. (2024) “Set Prediction Guided by Semantic Concepts for Diverse Video Captioning”, Proceedings of the AAAI Conference on Artificial Intelligence, 38(4), pp. 3909–3917. doi: 10.1609/aaai.v38i4.28183.