Crowley, M., & Poole, D. (2011). Policy Gradient Planning for Environmental Decision Making with Existing Simulators. Proceedings of the AAAI Conference on Artificial Intelligence, 25(1), 1323–1330. https://doi.org/10.1609/aaai.v25i1.7796