[1]

Rezaeifar, S., Dadashi, R., Vieillard, N., Hussenot, L., Bachem, O., Pietquin, O. and Geist, M. 2022. Offline Reinforcement Learning as Anti-exploration. Proceedings of the AAAI Conference on Artificial Intelligence. 36, 7 (Jun. 2022), 8106-8114. DOI:https://doi.org/10.1609/aaai.v36i7.20783.