Ryu, H., Kang, S., Kang, H., & Yoo, C. D. (2021). Semantic Grouping Network for Video Captioning. Proceedings of the AAAI Conference on Artificial Intelligence, 35(3), 2514-2522. Retrieved from https://ojs.aaai.org/index.php/AAAI/article/view/16353