(1)
Rezaeifar, S.; Dadashi, R.; Vieillard, N.; Hussenot, L.; Bachem, O.; Pietquin, O.; Geist, M. Offline Reinforcement Learning As Anti-Exploration. AAAI 2022, 36, 8106-8114.