[1]
T. C. Tong, S. He, Z. Shao, and D.-Y. Yeung, “G-VEval: A Versatile Metric for Evaluating Image and Video Captions Using GPT-4o”, AAAI, vol. 39, no. 7, pp. 7419–7427, Apr. 2025.