Johnson, N., Ángel A. Cabrera, G. Plumb, and A. Talwalkar. “Where Does My Model Underperform? A Human Evaluation of Slice Discovery Algorithms”. Proceedings of the AAAI Conference on Human Computation and Crowdsourcing, vol. 11, no. 1, Nov. 2023, pp. 65-76, doi:10.1609/hcomp.v11i1.27548.