Johnson, N., Cabrera, Ángel A., Plumb, G., & Talwalkar, A. (2023). Where Does My Model Underperform? A Human Evaluation of Slice Discovery Algorithms. Proceedings of the AAAI Conference on Human Computation and Crowdsourcing, 11(1), 65-76. https://doi.org/10.1609/hcomp.v11i1.27548