(1)
Ghavamzadeh, M.; Lazaric, A. Conservative and Greedy Approaches to Classification-Based Policy Iteration. AAAI 2021, 26, 914-920.