Garcelon, E., Ghavamzadeh, M., Lazaric, A. and Pirotta, M. (2020) “Improved Algorithms for Conservative Exploration in Bandits”, Proceedings of the AAAI Conference on Artificial Intelligence, 34(04), pp. 3962-3969. doi: 10.1609/aaai.v34i04.5812.