Simão, T. D., and M. T. J. Spaan. “Safe Policy Improvement With Baseline Bootstrapping in Factored Environments”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, no. 01, July 2019, pp. 4967-74, doi:10.1609/aaai.v33i01.33014967.