Shen, H., and T.-H. Huang. “How Useful Are the Machine-Generated Interpretations to General Users? A Human Evaluation on Guessing the Incorrectly Predicted Labels”. Proceedings of the AAAI Conference on Human Computation and Crowdsourcing, vol. 8, no. 1, Oct. 2020, pp. 168-72, doi:10.1609/hcomp.v8i1.7477.