Ghavamzadeh, M., & Lazaric, A. (2021). Conservative and Greedy Approaches to Classification-Based Policy Iteration. Proceedings of the AAAI Conference on Artificial Intelligence, 26(1), 914-920. https://doi.org/10.1609/aaai.v26i1.8304