[1]
Z. Wang, W. Wu, G. Wang, G. Ye, and Z. Cheng, “MetaEval: Measuring the Discrimination of Benchmarks for Efficient LLM Evaluation”, AAAI, vol. 40, no. 40, pp. 33773–33781, Mar. 2026.