Tong, Tony Cheng, Sirui He, Zhiwen Shao, and Dit-Yan Yeung. “G-VEval: A Versatile Metric for Evaluating Image and Video Captions Using GPT-4o”. Proceedings of the AAAI Conference on Artificial Intelligence 39, no. 7 (April 11, 2025): 7419–7427. Accessed May 12, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/32798.