Zhang, X., S. Bharti, Y. Ma, A. Singla, and X. Zhu. “The Sample Complexity of Teaching by Reinforcement on Q-Learning”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, no. 12, May 2021, pp. 10939-47, doi:10.1609/aaai.v35i12.17306.