Topin, N., Milani, S., Fang, F. and Veloso, M. (2021) “Iterative Bounding MDPs: Learning Interpretable Policies via Non-Interpretable Methods”, Proceedings of the AAAI Conference on Artificial Intelligence, 35(11), pp. 9923-9931. doi: 10.1609/aaai.v35i11.17192.