Flach, P. (2019) “Performance Evaluation in Machine Learning: The Good, the Bad, the Ugly, and the Way Forward”, Proceedings of the AAAI Conference on Artificial Intelligence, 33(01), pp. 9808-9814. doi: 10.1609/aaai.v33i01.33019808.