1.
Lu Y, Zhang Z, Yuan C, Li P, Wang Y, Li B, et al. Set Prediction Guided by Semantic Concepts for Diverse Video Captioning. AAAI [Internet]. 2024 Mar. 24 [cited 2026 May 13];38(4):3909-17. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/28183