Bonet, Blai, and Hector Geffner. “Action Selection for MDPs: Anytime AO* Versus UCT”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 26, no. 1, Sept. 2021, pp. 1749-55, doi:10.1609/aaai.v26i1.8369.