Issakkimuthu, M., Fern, A., & Tadepalli, P. (2018). Training Deep Reactive Policies for Probabilistic Planning Problems. Proceedings of the International Conference on Automated Planning and Scheduling, 28(1), 422-430. https://doi.org/10.1609/icaps.v28i1.13873