Crowley, Mark, and David Poole. “Policy Gradient Planning for Environmental Decision Making With Existing Simulators”. Proceedings of the AAAI Conference on Artificial Intelligence 25, no. 1 (August 4, 2011): 1323–1330. Accessed May 7, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/7796.