Zhang, X., Bharti, S., Ma, Y., Singla, A., & Zhu, X. (2021). The Sample Complexity of Teaching by Reinforcement on Q-Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 35(12), 10939-10947. https://doi.org/10.1609/aaai.v35i12.17306