Ghavamzadeh, Mohammad, and Alessandro Lazaric. 2021. “Conservative and Greedy Approaches to Classification-Based Policy Iteration”. Proceedings of the AAAI Conference on Artificial Intelligence 26 (1):914-20. https://doi.org/10.1609/aaai.v26i1.8304.