(1)
Wang, Z.; Wu, W.; Wang, G.; Ye, G.; Cheng, Z. MetaEval: Measuring the Discrimination of Benchmarks for Efficient LLM Evaluation. AAAI 2026, 40, 33773-33781.