Zhang, Shi-Xue, Hongfa Wang, Duojun Huang, Xin Li, Xiaobin Zhu, and Xu-Cheng Yin. 2026. “VCapsBench: A Large-Scale Fine-Grained Benchmark for Video Caption Quality Evaluation”. Proceedings of the AAAI Conference on Artificial Intelligence 40 (15):12726-34. https://doi.org/10.1609/aaai.v40i15.38269.