[1]
Y. Li, F. Guerin, and C. Lin, “LatestEval: Addressing Data Contamination in Language Model Evaluation through Dynamic and Time-Sensitive Test Construction”, AAAI, vol. 38, no. 17, pp. 18600–18607, Mar. 2024.