Verma, Deepali, Arya Haldar, and Tanima Dutta. “Leveraging Weighted Cross-Graph Attention for Visual and Semantic Enhanced Video Captioning Network”. Proceedings of the AAAI Conference on Artificial Intelligence 37, no. 2 (June 26, 2023): 2465–2473. Accessed May 10, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/25343.