Rezaeifar, S., Dadashi, R., Vieillard, N., Hussenot, L., Bachem, O., Pietquin, O. and Geist, M. (2022) “Offline Reinforcement Learning as Anti-exploration”, Proceedings of the AAAI Conference on Artificial Intelligence, 36(7), pp. 8106-8114. doi: 10.1609/aaai.v36i7.20783.