Lin, K., Gan, Z., & Wang, L. (2021). Augmented Partial Mutual Learning with Frame Masking for Video Captioning. Proceedings of the AAAI Conference on Artificial Intelligence, 35(3), 2047-2055. Retrieved from https://ojs.aaai.org/index.php/AAAI/article/view/16301