Lee, Sangho, Il Yong Chun, and Hogun Park. “MAMS: Model-Agnostic Module Selection Framework for Video Captioning”. Proceedings of the AAAI Conference on Artificial Intelligence 39, no. 5 (April 11, 2025): 4535-4543. Accessed April 27, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/32478.