Ge, Shiping, Qiang Chen, Zhiwei Jiang, Yafeng Yin, Liu Qin, Ziyao Chen, and Qing Gu. “Implicit Location-Caption Alignment via Complementary Masking for Weakly-Supervised Dense Video Captioning”. Proceedings of the AAAI Conference on Artificial Intelligence 39, no. 3 (April 11, 2025): 3113–3121. Accessed May 8, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/32320.