Shen, H., and T.-H. Huang. “How Useful Are the Machine-Generated Interpretations to General Users? A Human Evaluation on Guessing the Incorrectly Predicted Labels”. Proceedings of the AAAI Conference on Human Computation and Crowdsourcing, vol. 8, no. 1, Oct. 2020, pp. 168-72, https://ojs.aaai.org/index.php/HCOMP/article/view/7477.