Zhang, Xuezhou, Shubham Bharti, Yuzhe Ma, Adish Singla, and Xiaojin Zhu. 2021. “The Sample Complexity of Teaching by Reinforcement on Q-Learning”. Proceedings of the AAAI Conference on Artificial Intelligence 35 (12):10939-47. https://ojs.aaai.org/index.php/AAAI/article/view/17306.