Zhang, Shi-Xue, Hongfa Wang, Duojun Huang, Xin Li, Xiaobin Zhu, and Xu-Cheng Yin. “VCapsBench: A Large-Scale Fine-Grained Benchmark for Video Caption Quality Evaluation”. Proceedings of the AAAI Conference on Artificial Intelligence 40, no. 15 (March 14, 2026): 12726–12734. Accessed May 17, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/38269.