[1]
B. Bonet and H. Geffner, “Action Selection for MDPs: Anytime AO* Versus UCT”, AAAI, vol. 26, no. 1, pp. 1749-1755, Sep. 2021.