Rezaeifar, Shideh, Robert Dadashi, Nino Vieillard, Léonard Hussenot, Olivier Bachem, Olivier Pietquin, and Matthieu Geist. “Offline Reinforcement Learning As Anti-Exploration”. Proceedings of the AAAI Conference on Artificial Intelligence 36, no. 7 (June 28, 2022): 8106-8114. Accessed May 1, 2024. https://ojs.aaai.org/index.php/AAAI/article/view/20783.