Johnson, N., Cabrera, Ángel A., Plumb, G. and Talwalkar, A. (2023) “Where Does My Model Underperform? A Human Evaluation of Slice Discovery Algorithms”, Proceedings of the AAAI Conference on Human Computation and Crowdsourcing, 11(1), pp. 65-76. doi: 10.1609/hcomp.v11i1.27548.