[1]
M. Ghavamzadeh and A. Lazaric, “Conservative and Greedy Approaches to Classification-Based Policy Iteration”, AAAI, vol. 26, no. 1, pp. 914–920, Sep. 2021.