Zhang, Xuezhou, Shubham Bharti, Yuzhe Ma, Adish Singla, and Xiaojin Zhu. 2021. “The Sample Complexity of Teaching by Reinforcement on Q-Learning”. Proceedings of the AAAI Conference on Artificial Intelligence 35 (12):10939-47. https://doi.org/10.1609/aaai.v35i12.17306.