Eriksson, Maria, et al. “Can We Trust AI Benchmarks? An Interdisciplinary Review of Current Issues in AI Evaluation”. Proceedings of the AAAI ACM Conference on AI, Ethics, and Society, vol. 8, no. 1, Oct. 2025, pp. 850-64, doi:10.1609/aies.v8i1.36595.