(1)
Li, Y.; Guerin, F.; Lin, C. LatestEval: Addressing Data Contamination in Language Model Evaluation through Dynamic and Time-Sensitive Test Construction. AAAI 2024, 38, 18600-18607.