Zhang, S.-X., Wang, H., Huang, D., Li, X., Zhu, X., & Yin, X.-C. (2026). VCapsBench: A Large-scale Fine-grained Benchmark for Video Caption Quality Evaluation. Proceedings of the AAAI Conference on Artificial Intelligence, 40(15), 12726–12734. https://doi.org/10.1609/aaai.v40i15.38269