Johnson, N. (2023) “Where Does My Model Underperform? A Human Evaluation of Slice Discovery Algorithms”, Proceedings of the AAAI Conference on Human Computation and Crowdsourcing, 11(1), pp. 65–76. doi: 10.1609/hcomp.v11i1.27548.