Ghavamzadeh, M., and A. Lazaric. “Conservative and Greedy Approaches to Classification-Based Policy Iteration”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 26, no. 1, Sept. 2021, pp. 914-20, doi:10.1609/aaai.v26i1.8304.