[1]
T. D. Simão and M. T. J. Spaan, “Safe Policy Improvement with Baseline Bootstrapping in Factored Environments”, AAAI, vol. 33, no. 01, pp. 4967-4974, Jul. 2019.