Simão, T. D., & Spaan, M. T. J. (2019). Safe Policy Improvement with Baseline Bootstrapping in Factored Environments. Proceedings of the AAAI Conference on Artificial Intelligence, 33(01), 4967-4974. https://doi.org/10.1609/aaai.v33i01.33014967