[1]
X. Zhang, S. Bharti, Y. Ma, A. Singla, and X. Zhu, “The Sample Complexity of Teaching by Reinforcement on Q-Learning”, AAAI, vol. 35, no. 12, pp. 10939-10947, May 2021.