Cohen, A., L. Yu, and R. Wright. “Diverse Exploration for Fast and Safe Policy Improvement”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 32, no. 1, Apr. 2018, https://ojs.aaai.org/index.php/AAAI/article/view/11758.