[1]
B. Bonet and H. Geffner, “Action Selection for MDPs: Anytime AO* Versus UCT”, AAAI, vol. 26, no. 1, pp. 1749–1755, Sep. 2021.