Rezaeifar, S., R. Dadashi, N. Vieillard, L. Hussenot, O. Bachem, O. Pietquin, and M. Geist. “Offline Reinforcement Learning As Anti-Exploration”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 36, no. 7, June 2022, pp. 8106-14, doi:10.1609/aaai.v36i7.20783.